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95 Human Secreted Proteins 

Field of the Invention 

This invention relates to newly identified polynucleotides and the 
polypeptides encoded by these polynucleotides, uses of such polynucleotides and 
5 polypeptides, and their production. 

Background of the Invention 
Unlike bacterium, which exist as a single compartment surrounded by a 
membrane, human cells and other eucaryotes are subdivided by membranes into many 
functionally distinct compartments. Each membrane-bounded compartment, or 
10 organelle, contains different proteins essential for the function of the organelle. The 
cell uses "sorting signals," which are amino acid motifs located within the protein, to 
target proteins to particular cellular organelles. 

One type of sorting signal, called a signal sequence, a signal peptide, or a 
leader sequence, directs a class of proteins to an organelle called the endoplasmic 
15 reticulum (ER). The ER separates the membrane-bounded proteins from all other 
types of proteins. Once localized to the ER, both groups of proteins can be further 
directed to another organelle called the Golgi apparatus. Here, the Golgi distributes 
the proteins to vesicles, including secretory vesicles, the cell membrane, lysosomes, 
and the other organelles. 
20 Proteins targeted to the ER by a signal sequence can be released into the 

extracellular space as a secreted protein. For example, vesicles containing secreted 
proteins can fuse with the cell membrane and release their contents into the 
extracellular space - a process called exocytosis. Exocytosis can occur constitutively 
or after receipt of a triggering signal. In the latter case, the proteins are stored in 
25 secretory vesicles (or secretory granules) until exocytosis is triggered. Similarly, 
proteins residing on the cell membrane can also be secreted into the extracellular 
space by proteolytic cleavage of a "linker" holding the protein to the membrane. 

Despite the great progress made in recent years, only a small number of genes 
encoding human secreted proteins have been identified. These secreted proteins 
30 include the commercially valuable human insulin, interferon, Factor VIII, human 



BNSDOCID: <WO 9947540A1_I_> 



WO 99/47540 



PCTAJS99/05804 



growth hormone, tissue plasminogen activator, and erythropoeitin. Thus, in light of 
the pervasive role of secreted proteins in human physiology, a need exists for 
identifying and characterizing novel human secreted proteins and the genes that 
encode them. This knowledge will allow one to detect, to treat, and to prevent 
5 medical disorders by using secreted proteins or the genes that encode them. 

Summary of the Invention 

The present invention relates to novel polynucleotides and the encoded 
polypeptides. Moreover, the present invention relates to vectors, host cells, 
10 antibodies, and recombinant methods for producing the polypeptides and 

polynucleotides. Also provided are diagnostic methods for detecting disorders related 
to the polypeptides, and therapeutic methods for treating such disorders. The 
invention further relates to screening methods for identifying binding partners of the 
polypeptides. 

15 

Detailed Description 

Definitions 

The following definitions are provided to facilitate understanding of certain 
terms used throughout this specification. 

20 In the present invention, "isolated" refers to material removed from its original 

environment (e.g., the natural environment if it is naturally occurring), and thus is 
altered "by the hand of man" from its natural state. For example, an isolated 
polynucleotide could be part of a vector or a composition of matter, or could be 
contained within a cell, and still be "isolated" because that vector, composition of 

25 matter, or particular cell is not the original environment of the polynucleotide. 

In the present invention, a "secreted" protein refers to those proteins capable 
of being directed to the ER, secretory vesicles, or the extracellular space as a result of 
a signal sequence, as well as those proteins released into the extracellular space 
without necessarily containing a signal sequence. If the secreted protein is released 

30 into the extracellular space, the secreted protein can undergo extracellular processing 
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to produce a "mature" protein. Release into the extracellular space can occur by many 
mechanisms, including exocytosis and proteolytic cleavage. 

In specific embodiments, the polynucleotides of the invention are less than 
300 kb, 200 kb, 100 kb, 50 kb, 15 kb, 10 kb, or 7.5 kb in length. In a further 
5 embodiment, polynucleotides of the invention comprise at least 15 contiguous 
nucleotides of the coding sequence, but do not comprise all or a portion of any intron. 
In another embodiment, the nucleic acid comprising the coding sequence does not 
contain coding sequences of a genomic flanking gene (i.e., 5* or 3' to the gene in the 
genome). 

10 As used herein , a "polynucleotide" refers to a molecule having a nucleic acid 

sequence contained in SEQ ID NO:X or the cDNA contained within the clone 
deposited with the ATCC. For example, the polynucleotide can contain the 
nucleotide sequence of the full length cDNA sequence, including the 5' and 3' 
untranslated sequences, the coding region, with or without the signal sequence, the 

15 secreted protein coding region, as well as fragments, epitopes, domains, and variants 
of the nucleic acid sequence. Moreover, as used herein, a "polypeptide" refers to a 
molecule having the translated amino acid sequence generated from the 
polynucleotide as broadly defined. 

In the present invention, the full length sequence identified as SEQ ID NO:X 

20 was often generated by overlapping sequences contained in multiple clones (contig 
analysis). A representative clone containing all or most of the sequence for SEQ ID 
NO:X was deposited with the American Type Culture Collection ("ATCC"). As 
shown in Table 1, each clone is identified by a cDNA Clone ID (Identifier) and the 
ATCC Deposit Number. The ATCC is located at 10801 University Boulevard, 

25 Manassas, Virginia 201 10-2209, USA. The ATCC deposit was made pursuant to the 
terms of the Budapest Treaty on the international recognition of the deposit of 
microorganisms for purposes of patent procedure. 

A "polynucleotide" of the present invention also includes those 
polynucleotides capable of hybridizing, under stringent hybridization conditions, to 

30 sequences contained in SEQ ID NO:X, the complement thereof, or the cDNA within 
the clone deposited with the ATCC. "Stringent hybridization conditions" refers to an 
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overnight incubation at 42° C in a solution comprising 50% formamide, 5x SSC (750 
mM NaCl, 75 mM sodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's 
solution, 10% dextran sulfate, and 20 p-g/ml denatured, sheared salmon sperm DNA, 
followed by washing the filters in O.lx SSC at about 65°C. 

Also contemplated are nucleic acid molecules that hybridize to the 
polynucleotides of the present invention at lower stringency hybridization conditions. 
Changes in the stringency of hybridization and signal detection are primarily 
accomplished through the manipulation of formamide concentration (lower 
percentages of formamide result in lowered stringency); salt conditions, or 
temperature. For example, lower stringency conditions include an overnight 
incubation at 37°C in a solution comprising 6X SSPE (20X SSPE = 3M NaCl: 0.2M 
NaH 2 P0 4 ; 0.02M EDTA, pH 7.4), 0.5% SDS, 30% formamide, 100 ug/ml salmon 
sperm blocking DNA; followed by washes at 50°C with 1XSSPE, 0.1% SDS. In 
addition, to achieve even lower stringency, washes performed following stringent 
hybridization can be done at higher salt concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. Typical blocking reagents include 
Denhardt's reagent, BLOTTO, heparin, denatured salmon sperm DNA, and 
commercially available proprietary formulations. The inclusion of specific blocking 
reagents may require modification of the hybridization conditions described above, 
due to problems with compatibility. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such 
as any 3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 
complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide," since such a polynucleotide would hybridize to any nucleic acid 
molecule containing a poly (A) stretch or the complement thereof (e.g., practically 
any double-stranded cDNA clone). 

The polynucleotide of the present invention can be composed of any 
polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or 
DNA or modified RNA or DNA. For example, polynucleotides can be composed of 
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single- and double-stranded DNA, DNA that is a mixture of single- and double- 
stranded regions, single- and double-stranded RNA, and RNA that is mixture of 
single- and double-stranded regions, hybrid molecules comprising DNA and RNA 
that may be single-stranded or, more typically, double-stranded or a mixture of single- 
5 and double-stranded regions. In addition, the polynucleotide can be composed of 
triple-stranded regions comprising RNA or DNA or both RNA and DNA. A 
polynucleotide may also contain one or more modified bases or DNA or RNA 
backbones modified for stability or for other reasons. "Modified" bases include, for 
example, tritylated bases and unusual bases such as inosine. A variety of 

10 modifications can be made to DNA and RNA; thus, "polynucleotide" embraces 
chemically, enzymatically, or metabolically modified forms. 

The polypeptide of the present invention can be composed of amino acids 
joined to each other by peptide bonds or modified peptide bonds, i.e., peptide 
isosteres, and may contain amino acids other than the 20 gene-encoded amino acids. 

15 The polypeptides may be modified by either natural processes, such as 

posttranslational processing, or by chemical modification techniques which are well 
known in the art. Such modifications are well described in basic texts and in more 
detailed monographs, as well as in a voluminous research literature. Modifications 
can occur anywhere in a polypeptide, including the peptide backbone, the amino acid 

20 side-chains and the amino or carboxyl termini. It will be appreciated that the same 
type of modification may be present in the same or varying degrees at several sites in 
a given polypeptide. Also, a given polypeptide may contain many types of 
modifications. Polypeptides may be branched , for example, as a result of 
ubiquitination, and they may be cyclic, with or without branching. Cyclic, branched, 

25 and branched cyclic polypeptides may result from posttranslation natural processes or 
may be made by synthetic methods. Modifications include acetylation, acylation, 
ADP-ribosylation, amidation, covalent attachment of flavin, covalent attachment of a 
heme moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent 
attachment of a lipid or lipid derivative, covalent attachment of phosphotidylinositol, 

30 cross-linking, cyclization, disulfide bond formation, demethylation, formation of 

covalent cross-links, formation of cysteine, formation of pyroglutamate, formylation, 
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gamma-carboxylation, glycosylation, GPI anchor formation, hydroxylation, 
iodination, methylation, myristoylation, oxidation, pegylation, proteolytic processing, 
phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer-RNA 
mediated addition of amino acids to proteins such as arginylation, and ubiquitination. 
5 (See, for instance, PROTEINS - STRUCTURE AND MOLECULAR PROPERTIES, 
2nd Ed., T. E. Creighton, W. H. Freeman and Company, New York (1993); 
POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. C. 
Johnson, Ed., Academic Press, New York, pgs. 1-12 (1983); Seifter et al., Meth 
Enzymol 182:626-646 (1990); Rattan et al., Ann NY Acad Sci 663:48-62 (1992).) 
10 "SEQ ID NO:X" refers to a polynucleotide sequence while "SEQ ID NO:Y n 

refers to a polypeptide sequence, both sequences identified by an integer specified in 
Table 1. 

"A polypeptide having biological activity" refers to polypeptides exhibiting 
activity similar, but not necessarily identical to, an activity of a polypeptide of the 

15 present invention, including mature forms, as measured in a particular biological 
assay, with or without dose dependency. In the case where dose dependency does 
exist, it need not be identical to that of the polypeptide, but rather substantially similar 
to the dose-dependence in a given activity as compared to the polypeptide of the 
present invention (i.e., the candidate polypeptide will exhibit greater activity or not 

20 more than about 25-fold less and, preferably, not more than about tenfold less 

activity, and most preferably, not more than about three-fold less activity relative to 
the polypeptide of the present invention.) 

Polynucleotides and Polypeptides of the Invention 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 1 

This gene is expressed primarily in anergic T cells and merkel cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
30 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune disorders and inflammatory diseases. Similarly, polypeptides 
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and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the immune system, expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 
5 tissues or cell types (e.g., immune, cancerous and wounded tissues) or bodily fluids 
(e.g., lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from 
an individual not having the disorder. 
10 Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 

108 as residues: Ala-55 to Gln-64. 

The tissue distribution in T-cells and merkel cells indicates that the protein 
products of this gene are useful for the diagnosis and/or treatment of immune system 
diseases. Furthermore, 

15 Expression of this gene product in T-cells indicates a role in the regulation of 

the proliferation; survival; differentiation; and/or activation of potentially all 
hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 

20 boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 

25 deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Protein, as well as, antibodies directed against the protein may show utility as a 

30 tumor marker and/or immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 1 1 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2329 of SEQ ID NO: 1 1, b is an 
integer of 15 to 2343, where both a and b correspond to the positions of nucleotide 
10 residues shown in SEQ ID NO: 1 1 , and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 2 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: IPENRRPASXCTWSMWTSRTTTRRPPWGRFSSVSSASV 

15 SSTRKTWRTRSTSCCRSSRRRVAAPFCTPSASTEPSARMEPPLELPVVHTFSFL 
TFVFTYRCSAGDGSITQINCAYEMGEEMPKRQMKAIKFLLFHFYL (SEQ ID 
NO:205), IPENRRPASXCTWSMWTSRTTTRRPPWGRFSSVSS ASVSST (SEQ ID 
NO:206), RKTWRTRSTSCCRSSRRRVAAPFCTPSASTEPSARMEPPLELP (SEQ 
ID NO:207), and/or VVHTFSFLTFVFTYRCSAGDGSITQINCAYEMGEEMPKRQ 

20 MKAIKFLLFHFYL (SEQ ID NO:208). Polynucleotides encoding these polypeptides 
are also encompassed by the invention. 

This gene is expressed primarily in placental, brain and breast tissues, and to a 
lesser extent in T cells and tumors. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neurodegenerative and/or endocrine disorders and neoplasias, or 
developmental disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 

30 identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the neurodegenerative, developing, endocrine and 
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immune systems, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues or cell types (e.g., brain, endocrine, immune, 
developing, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
5 from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
109 as residues: Ala-55 to Asn-60, Lys-65 to Met-71, Leu-75 to Asn-86, Asp-93 to 
10 Asp-1 10, Leu-130 to Cys-138, Gln-149 to Glu-154, Thr-172 to Ile-179, Glu-185 to 
Arg-192. 

The tissue distribution in breast and brain tissues indicates that the protein 
products of this gene are useful for the diagnosis and/or treatment of endocrine 
disorders, neurodegenerative disorders, developmental disorders, immune system 

15 diseases and neoplasias. The tissue distribution in placental tissue indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and/or treatment of disorders of the placenta. Specific expression within the 
placenta indicates that this gene product may play a role in the proper establishment 
and maintenance of placental function. Alternately, this gene product may be 

20 produced by the placenta and then transported to the embryo, where it may play a 
crucial role in the development and/or survival of the developing embryo or fetus. 

Expression of this gene product in a vascular-rich tissue such as the placenta 
also indicates that this gene product may be produced more generally in endothelial 
cells or within the circulation. In such instances, it may play more generalized roles in 

25 vascular function, such as in angiogenesis. It may also be produced in the vasculature 
and have effects on other cells within the circulation, such as hematopoietic cells. It 
may serve to promote the proliferation, survival, activation, and/or differentiation of 
hematopoietic cells, as well as other cells throughout the body. Likewise, 

Expression of this gene product in T-cells indicates a role in the regulation of 

30 the proliferation; survival; differentiation; and/or activation of potentially all 

hematopoietic cell lineages, including blood stem cells. This gene product may be 
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involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may. be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Alternatively, the tissue distribution in brain tissue indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
detection/treatment of neurodegenerative disease states and behavioural disorders 
such as Alzheimers Disease, Parkinsons Disease, Huntingtons Disease, Tourette 
Syndrome, schizophrenia, mania, dementia, paranoia, obsessive compulsive disorder, 
panic disorder, learning disabilities, ALS, psychoses, autism, and altered behaviors, 
including disorders in feeding, sleep patterns, balance, and perception. In addition, the 
gene or gene product may also play a role in the treatment and/or detection of 
developmental disorders associated with the developing embryo, or sexually-linked 
disorders. Protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 12 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 163 of SEQ ID NO: 12, b is an 
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integer of 15 to 1 177, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 12, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 3 

5 The translation product of this gene shares sequence homology with bovine 

beta-mannosidase, which is thought to be important in lysosomal catabolism of 
glycoproteins. See, for example, J. Biol. Chem. 270, 3841-3848 (1995), incorporated 
herein by reference in its entirety. Based on the sequence similarity between these 
proteins the translation product of this gene will sometimes hereinafter be reffered to 

10 as human beta-mannosidase. Human beta-mannosidase is expected to share certain 
biological activities, particularly enzymatic activities, with bovine beta-mannosidase. 
Such activities may be assayed by methods known in the art, described in J. Biol. 
Chem. 270, 3841-3848 (1995), and/or disclosed elsewhere herein. 

In specific embodiments, polypeptides of the invention comprise the following 

15 amino acid sequences: HPSIIIWSGNNENEEALMMNWYHISFTDRPIYIKDYVTL 
YVKNIRELVLAGDKSRPFITSSPTNGAETVAEAWVSQNPNSNYFGDVHFYDYI 
SDCWNWKVFPKARFASEYGYQSWPSFSTLEKVSSTEDWSFNSKFSLHRQHH 
EGGNKQMLYQAGLHFKLPQSTDPLRTFKDTIYLTQVMQAQCVKTETEFYRRS 
RSEIVDQQGHTMGALYWQLNDIWQAPSW (SEQ ID NO:209), and/or 

20 VRVHTWS 

SLEPVCSRVTERFVMKGGEAVCLYEEPVSELLRRCGNCTRESCVVSFYLSAD 
HELLSPTNYHFLSSPKEAVGLCKAQITAIISQQGDIFVFDLETSAVAPFVWLDV 
GSIPGRFSDNGFLMTEKTRTILFYPWEPTSKNELEQSFHVTSLTDIY (SEQ ID 
NO:210). Polynucleotides encoding these polypeptides are also encompassed by the 

25 invention. The gene encoding the disclosed cDNA is thought to reside on 

chromosome 4. Accordingly, polynucleotides related to this invention are useful as a 
marker in linkage analysis for chromosome 4. 

This gene is expressed primarily in colon tissue, and to a lesser extent in 
thymus stromal cells and chondrosarcoma tissue. 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, chondroma and mannosidosis. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the chondro and immune system. The 
expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues or cell types (e.g., immune, metabolic, cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid 
and spinal fluid) or another tissue or cell sample taken from an individual having such 
a disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to bovine beta-mannosidase indicates 
that the protein products of this gene are useful for the diagnosis and/or treatment of 
chondroma and mannosidosis. Human beta-mannosidosis is an autosomal recessive, 
lysosomal storage disease caused by a deficiency of the enzyme beta-mannosidase. 
Furthermore, the homology of the translation product of this gene to beta- 
mannosidase indicates that polynucleotides and polypeptides corresponding to this 
gene are useful for the diagnosis, prevention, and/or treatment of various metabolic 
disorders such as lysosomal storage deficiencies, Tay-Sachs disease, 
phenylkenonuria, galactosemia, hyperlipidemias, porphyrias, and Hurler's syndrome. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 13 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 2093 of SEQ ID NO: 13, b is an 
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integer of 15 to 2107, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 13, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 4 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: PRLTPRMKWPTAALASRLLGWTVLRPPYPRVPSLPQVT 
LHPTDGLMAVLYTGGEGRTLGEQHFFHETFVTRWLLGPVPVRFGACSPLSFL 
APRRGQGAPAGXFCACPRPASRQLCPWPALPGTPYSNSAPLCTGMGHSNTPQ 
GPPS PQ Y ALSPTEPTSLSGNSHLPAIL VL (SEQ ID NO:211), 
PRLTPRMKWPTAAL ASRLLGWTVLRPPYPRVPSLPQVTLHP (SEQ ID 
NO:212), TDGLMAVLYTGGE GRTLGEQHFFHETFVTRWLLGPVPVRFG (SEQ 
ID NO:213), ACSPLSFLAPRRGQGAPAGXFCACPRPAS RQLCPWPALPGTP 
(SEQ ID NO:214), and/or 

YSNSAPLCTGMGHSNTPQGPPSPQYALSPTEPTSLSGNS HLPAILVL (SEQ ID 
NO:215). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. 

This gene is expressed primarily in human lung (adult and fetal), and to a 
lesser extent in liver and brain tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, pulmonary disorders and hemostasis. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the lung and liver tissues, 
expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues or cell types (e.g., pulmonary, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid and spinal 
fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
1 1 1 as residues: Arg-28 to Gln-36. 

The tissue distribution in lung and liver tissues indicates that the protein 
products of this gene are useful for the diagnosis and/or treatment of pulmonary 
5 disorders and hematopoietic disorders. The tissue distribution in adult and fetal lung 
tissues indicates that polynucleotides and polypeptides corresponding to this gene are 
useful for the detection and treatment of disorders associated with developing lungs, 
particularly in premature infants where the lungs are the last tissues to develop. The 
tissue distribution indicates that polynucleotides and polypeptides corresponding to 
10 this gene are useful for the diagnosis and intervention of lung tumors, since the gene 
may be involved in the regulation of cell division, particularly since it is expressed in 
fetal tissue. Alternatively, 

Expression of this gene product in liver tissue indicates a role in the regulation 
of the proliferation; survival; differentiation; and/or activation of potentially all 
15 hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
20 well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
25 commercial utility in the expansion of stem cells and committed progenitors of 

various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
30 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO: 14 and may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
5 formula of a-b, where a is any integer between 1 to 1248 of SEQ ID NO: 14, b is an 
integer of 15 to 1262, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 14, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 5 
10 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: HLLEVTPCRLPVPEFPGRTPRGSRTPD (SEQ ID NO:216). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

This gene is expressed primarily in rapidly dividing liver tissue, (e.g., 
hepatoma, hepatocellular carcinoma, and fetal liver tissue), and to a lesser extent in 

15 normal liver tissue, and other tumors such as colon cancer and uterine cancer. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancers, particularly hepatomas, colon cancer, and uterine cancer. 

20 Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
liver, colon and uterus, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues or cell types (e.g., liver, colon, uterus, 

25 cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
1 12 as residues: Trp-35 to Trp-45, Pro-52 to Asp-57, Thr-73 to Arg-82, Pro- 105 to 
Leu- 1 12, Pro-1 15 to Arg-127, Pro- 140 to Gin- 151. 

The tissue distribution in liver tissues and cancers thereof, as well as other 
cancerous tissues, indicates that the protein products of this gene are useful for the 
diagnosis and/or treatment of cancers, particularly, hepatoma, colon cancer and 
uterine cancer, as well as cancers of other tissues where expression has been 
observed. Furthermore, expression within cellular sources marked by proliferating 
cells indicates that this protein may play a role in the regulation of cellular division, 
and may show utility in the diagnosis and treatment of cancer and other proliferative 
disorders. Thus, this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 15 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 745 of SEQ ID NO: 15, b is an 
integer of 15 to 759, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 15, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 6 

This gene is expressed primarily in hepatocellular tumors. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, hepatomas. Similarly, polypeptides and antibodies directed to these 
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polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the liver, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues or cell types (e.g., liver, 
5 cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

10 Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 

1 13 as residues: Pro-32 to Gly-40. 

The tissue distribution in hepatocellular tumors indicates that the protein 
products of this gene are useful for the diagnosis and/or treatment of hepatomas, as 
well as cancers of other tissues where expression has been observed. Furthermore, 

15 expression within cellular sources marked by proliferating cells indicates that this 
protein may play a role in the regulation of cellular division, and may show utility in 
the diagnosis and treatment of cancer and other proliferative disorders. Thus, this 
protein may also be involved in apoptosis or tissue differentiation and could again be 
useful in cancer therapy. Protein, as well as, antibodies directed against the protein 

20 may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 16 and may have been publicly available prior to conception of 

25 the present invention. Preferably, such related polynucleotides are specifically 

excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1796 of SEQ ID NO: 16, b is an 

30 integer of 15 to 1810, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 16, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 7 

This gene is expressed primarily in human rhabdomyosarcoma tissue, as well 
as in placental tissue. 

5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, malignant neoplasms and reproductive disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

10 immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the skeletal system 
and reproductive system, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues or cell types (e.g., reproductive, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 

15 synovial fluid and spinal fluid) or another tissue or cell sample taken from an 

individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
20 1 14 as residues: Arg-23 to Trp-28, Phe-93 to Lys-98, Arg-199 to Trp-206, Gly-208 to 
Met-213. 

The tissue distribution in placental tissue and human rhabdomyosarcoma 
tissue indicates that the protein products of this gene are useful for the diagnosis 
and/or treatment of skeletal and reproductive disorders. Furthermore, the tissue 

25 distribution in placental tissue indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for the diagnosis and/or treatment of disorders 
of the placenta. Specific expression within the placenta indicates that this gene 
product may play a role in the proper establishment and maintenance of placental 
function. Alternately, this gene product may be produced by the placenta and then 

30 transported to the embryo, where it may play a crucial role in the development and/or 
survival of the developing embryo or fetus. 
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Expression of this gene product in a vascular-rich tissue such as the placenta 
also indicates that this gene product may be produced more generally in endothelial 
cells or within the circulation. In such instances, it may play more generalized roles in 
vascular function, such as in angiogenesis. It may also be produced in the vasculature 
5 and have effects on other cells within the circulation, such as hematopoietic cells. It 
may serve to promote the proliferation, survival, activation, and/or differentiation of 
hematopoietic cells, as well as other cells throughout the body. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

10 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 1 7 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 

15 cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1038 of SEQ ID NO: 17, b is an 
integer of 15 to 1052, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 17, and where b is greater than or equal to a + 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 8 

This gene is expressed primarily in fetal liver/spleen and fetal skin tissues, and 
to a lesser extent in breast cancer tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental disorders and neoplasias. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 

30 disorders of the above tissues or cells, particularly of the fetal tissue and adult 

immune system, expression of this gene at significantly higher or lower levels may be 
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routinely detected in certain tissues or cell types (e.g., developing, immune, cancerous 
and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
5 level in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in fetal liver/spleen and skin tissues indicates that the 
protein products of this gene are useful for the diagnosis and/or treatment of 
developmental disorders and malignant neoplasias. Likewise, expression within fetal 
tissue and other cellular sources marked by proliferating cells indicates that this 
10 protein may play a role in the regulation of cellular division, and may show utility in 
the diagnosis and treatment of cancer and other proliferative disorders. Similarly, fetal 
development also involves decisions involving cell differentiation and/or apoptosis in 
pattern formation. Thus, this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. 
I 5 Alternatively, the tissue distribution in fetal skin tissue indicates that 

polynucleotides and polypeptides corresponding to this gene are useful for the 
treatment, diagnosis, and/or prevention of various skin disorders including congenital 
disorders (i.e. nevi, moles, freckles, Mongolian spots, hemangiomas, port-wine 
syndrome), integumentary tumors (i.e. keratoses, Bowen's disease, basal cell 
20 carcinoma, squamous cell carcinoma, malignant melanoma, Paget' s disease, mycosis 
fungoides, and Kaposi's sarcoma), injuries and inflammation of the skin (i.e.wounds, 
rashes, prickly heat disorder, psoriasis, dermatitis), atherosclerosis, uticaria, eczema, 
photosensitivity, autoimmune disorders (i.e. lupus erythematosus, vitiligo, 
dermatomyositis, morphea, scleroderma, pemphigoid, and pemphigus), keloids, striae, 
25 erythema, petechiae, purpura, and xanthelasma. Moreover, such disorders may 

predispose increased susceptibility to viral and bacterial infections of the skin (i.e. 
cold sores, warts, chickenpox, molluscum contagiosum, herpes zoster, boils, cellulitis, 
erysipelas, impetigo, tinea, althletes foot, and ringworm). Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
30 immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 18 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 1 1 16 of SEQ ID NO: 18, b is an 
integer of 15 to 1 130, where both a and b correspond to the positions of nucleotide 
10 residues shown in SEQ ID NO: 18, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 9 

The translation product of this gene shares sequence homology with the 
bacterial guf A gene, as well as a C. elegans protein of unknown function. 

15 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: MIPGSDSQTALNFGSTLMKKKSDPEGPALLFPESELSIRI 
GRAGLLSDKSENGEAYQRKKAAATGLPEGPAVPVPSRGNLAQPGGSSWRRI 
ALLILAITIHNVPEGLAVGVGFGAIEKTASATFESARNLAIGIGIQNFPEGLAVS 
LPLRGAGFSTWRAFWYGQLSGMVEPLAGVFGAFAVVLAEPILPYALAFAAG 

20 AMVYVVMDDIIPEAQISGNGKLASWASILGFVVMMSLDVGLG (SEQ ID 
NO:217), MIPGSDSQTALNFGSTLMKKKSDPEGPALLFPESELSIRIGRA (SEQ 
ID NO:218), GLLSDKSENGEAYQRKKAAATGLPEGPAVPVPSRGNLAQPG 
(SEQ ID N O : 2 1 9 ) , 

GSSWRRIALLILAITIHNVPEGLAVGVGFGAIEKTASATFESAR (SEQ ID 

25 NO:220), NLAIGIGIQNFPEGLAVSLPLRGAGFSTWRAFWYGQLS GMVEP 
(SEQ ID NO:221), LAGVFGAFAVVLAEPILPYALAFAAGAMVYVVM 
DDIIPEAQIS (SEQ ID NO:222), and/or GNGKLASWASILGFVVMMSLDVGLG 
(SEQ ID NO:223). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

30 This gene is expressed primarily in cells of the immune system, particularly 

macrophage. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders of the immune system, such as AIDS, as well as 
inflammatory disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues or cell 
types (e.g., immune, cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and treatment of a variety of 
immune system disorders. Expression of this gene product in immune cells such as 
macrophage indicates a role in the regulation of the proliferation; survival; 
differentiation; and/or activation of potentially all hematopoietic cell lineages, 
including blood stem cells. This gene product may be involved in the regulation of 
cytokine production, antigen presentation, or other processes that may also suggest a 
usefulness in the treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Expression of this gene product in macrophage also strongly indicates a role 
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for this protein in immune function and immune surveillance. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
5 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO: 19 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
cumbersome. Accordingly, preferably excluded from the present invention are one or 
10 more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 869 of SEQ ID NO: 19, b is an 
integer of 15 to 883, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO: 19, and where b is greater than or equal to a + 14. 



1 5 FEATURES OF PROTEIN ENCODED BY GENE NO: 10 

This gene is expressed primarily in the spleen metastic melanoma tissue as 
well as in embryonic tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, disorders affecting the spleen or immune system, developmental 
disorders, and cancers. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 

25 tissues or cells, particularly of the immune system, expression of this gene at 

significantly higher or lower levels may be routinely detected in certain tissues or cell 
types (e.g., spleen, developing, cancerous and wounded tissues) or bodily fluids (e.g., 
lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 

30 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
1 17 as residues: Asn-37 to Lys-44, Ser-73 to Glu-78, Ala-103 to Ser-1 11. 

The tissue distribution in spleen metastic melanoma and embryonic tissues 
indicates that the protein products of this gene are useful for the diagnosis and/or 
5 treatment of disorders affecting the spleen, including cancers of the spleen, as well as 
cancers of other tissues where expression has been observed. Furthermore, expression 
within embryonic tissue and other cellular sources marked by proliferating cells 
indicates that this protein may play a role in the regulation of cellular division, and 
may show utility in the diagnosis and treatment of cancer and other proliferative 
10 disorders. Similarly, embryonic development also involves decisions involving cell 
differentiation and/or apoptosis in pattern formation. Thus, this protein may also be 
involved in apoptosis or tissue differentiation and could again be useful in cancer 
therapy. Protein, as well as, antibodies directed against the protein may show utility as 
a tumor marker and/or immunotherapy targets for the above listed tissues. 
15 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:20 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence is 
20 cumbersome. Accordingly, preferably excluded from the present invention are one or 
more polynucleotides comprising a nucleotide sequence described by the general 
formula of a-b, where a is any integer between 1 to 975 of SEQ ID NO:20, b is an 
integer of 15 to 989, where both a and b correspond to the positions of nucleotide 
residues shown in SEQ ID NO:20, and where b is greater than or equal to a + 14. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 11 

It has been discovered that this gene is expressed primarily in cells of the 

immune system, including monocytes and neutrophils. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
30 identification of the tissue(s) or cell type(s) present in a biological sample and for 

diagnosis of the following diseases and conditions: disorders affecting the immune 
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system, such as AIDS. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
5 lower levels may be detected in certain tissues or cell types (e.g., immune, cancerous 
and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial 
fluid or spinal fluid) taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue from an 
individual not having the disorder. 

10 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

1 18 as residues: Ser-12 to Asp-20, Gly-22 to Gly-32, Ala-49 to Thr-57. 

The tissue distribution in monocytes and neutrophils indicates that the protein 
products of this clone are useful for the diagnosis and/or treatment of immune system 
disorders, including AIDS. Furthermore, expression of this gene product in 

15 monocytes and neutrophils suggests a role in the regulation of the proliferation; 
survival; differentiation; and/or activation of potentially all hematopoietic cell 
lineages, including blood stem cells. This gene product may be involved in the 
regulation of cytokine production, antigen presentation, or other processes that may 
also suggest a usefulness in the treatment of cancer (e.g. by boosting immune 

20 responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 

25 deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Expression of this gene product in monocytes and neutrophils also strongly 

30 suggests a role for this protein in immune function and immune surveillance. Protein, 
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as well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
5 related to SEQ ID NO:21 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
10 general formula of a-b, where a is any integer between 1 to 481 of SEQ ID NO:21, b 
is an integer of 15 to 495, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:21 , and where b is greater than or equal to a 
+ 14. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 12 

It has been discovered that this gene is expressed primarily in cells of the 
immune system, including monocytes. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

20 diagnosis of the following diseases and conditions: disorders affecting the immune 
system. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at significantly higher or lower levels 

25 may be detected in certain tissues or cell types (e.g., immune, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

30 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

1 19 as residues: Glu-35 to Trp-42. 
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The tissue distribution suggests that the protein product of this clone is useful 
for the diagnosis and treatment of a variety of immune system disorders. Expression 
of this gene product in monocytes suggests a role in the regulation of the 
proliferation; survival; differentiation; and/or activation of potentially all 
5 hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 

10 well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 

15 commercial utility in the expansion of stem cells and committed progenitors of 

various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Expression of this gene product in monocytes also strongly suggests a role for 
this protein in immune function and immune surveillance. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 

20 immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 22 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

25 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 2303 of SEQ ID NO:22, b 
is an integer of 15 to 2317, where both a and b correspond to the positions of 

30 nucleotide residues shown in SEQ ID NO:22, and where b is greater than or equal to a 
+ 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 13 

It has been discovered that this gene is expressed primarily in cells of the 
immune system, including monocytes. 
5 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: disorders of the immune system. 
Similarly, polypeptides and antibodies directed to those polypeptides are useful to 
provide immunological probes for differential identification of the tissue(s) or cell 
10 type(s). For a number of disorders of the above tissues or cells, particularly of the 

immune system, expression of this gene at significantly higher or lower levels may be » 1 
detected in certain tissues or cell types (e.g., immune, cancerous and wounded tissues) 
or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) 
taken from an individual having such a disorder, relative to the standard gene 
15 expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

The tissue distribution in monocytes indicates that the protein products of this 
clone are useful for the diagnosis and/or treatment of disorders of the immune system. 
Expression of this gene product in monocytes suggests a role in the regulation of the 
20 proliferation; survival; differentiation; and/or activation of potentially all 

hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes that may also suggest a usefulness in the treatment of cancer (e.g. by 
boosting immune responses). 
25 Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 

well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
30 bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
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various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Expression of this gene product in monocytes also strongly suggests a role for 
this protein in immune function and immune surveillance. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:23 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1712 of SEQ ID NO:23, b 
is an integer of 15 to 1726, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:23, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 14 

The translation product of this gene shares sequence homology with a gene 
from C. elegans of unknown function. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: TRPITYVLLAG (SEQ ID NO:224). Polynucleotides encoding 
these polypeptides are also encompassed by the invention. The gene encoding the 
disclosed cDNA is thought to reside on chromosome 1 1 . Accordingly, 
polynucleotides related to this invention are useful as a marker in linkage analysis for 
chromosome 11. 

It has been discovered that this gene is expressed primarily in fetal lung, liver, 
spleen and heart tissues, as well as adult liver, bladder, endometrial stromal cells, 
synovium, colon cancer, smooth muscle, keratinocytes, and the bone marrow derived 
cell line RS4;11. 
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Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: disorders of the musculo-skeletal 
system, and cancers of the immune system. Similarly, polypeptides and antibodies 
directed to those polypeptides are useful to provide immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the musculo-skeletal and immune systems, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., immune, musculo-skeletal, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

The tissue distribution in tissues of the immune system indicates that the 
protein products of this clone are useful for treating proliferative disorders of immune 
system precursor cells. Alternatively, the tissue distribution in smooth muscle and 
heart tissue indicates that the protein product of this gene is useful for the diagnosis 
and treatment of conditions and pathologies of the cardiovascular system, such as 
heart disease, restenosis, atherosclerosis, stoke, angina, thrombosis, and wound 
healing. Protein, as well as, antibodies directed against the protein may show utility as 
a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:24 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 5 15 of SEQ ID NO:24, b 
is an integer of 15 to 529, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO:24, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 15 

5 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: GTSLTAPLLEFLLALYFLFADAMQLNDKWQGLCWP 
(SEQ ID NO:225). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

It has been discovered that this gene is expressed primarily in T-cells, fetal 
10 spleen and infant brain tissues, and to a lesser extent in many other tissues including 
melanocytes, lung cancer, macrophages, dendritic cells, stromal cells, adrenal gland 
and others. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

15 diagnosis of the following diseases and conditions: inflammation and autoimmunity, 
developing tissues. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune and developing system, expression of this gene at 

20 significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., immune, developing, cancerous and wounded tissues) or bodily fluids (e.g., 
lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

25 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

122 as residues: Ser-46 to Gly-51. 

The tissue distribution in T-cells and other immune cells indicates that the 
protein products of this clone are useful for treating diseases involving the activation 
of T-cells, including inflammation and autoimmune diseases. Alternatively, the tissue 

30 distribution in a wide range of fetal tissues suggests that this protein may play a role 
in the regulation of cellular division, and may show utility in the diagnosis and 
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treatment of cancer and other proliferative disorders. Similarly, fetal development 
also involves decisions involving cell differentiation and/or apoptosis in pattern 
formation. Thus, this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. Protein, as well as, 
5 antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:25 and may have been publicly available prior to conception of 

10 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1741 of SEQ ID NO:25, b 

15 is an integer of 15 to 1755, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:25, and where b is greater than or equal to a 
+ 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 16 

20 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: LANFZCSDCAQTVLFVLZFZILVFTYEIPF (SEQ ID 
NO:226). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. The gene encoding the disclosed cDNA is thought to reside on 
chromosome 13. Accordingly, polynucleotides related to this invention are useful as a 

25 marker in linkage analysis for chromosome 13. Recently another group published this 
gene, referring to it as CLN5 (See Genbank Accession No.: 3342386). 

It has been discovered that this gene is expressed primarily in placental tissue, 
12 week embryos, and tumors including testes, tongue and pharynx, and to a lesser 
extent in adipose tissue, tonsils, melanocytes, fetal spleen, macrophages, T-cells, 

30 amniotic cells, and brain tissue. 
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Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: tumors, particularly of the tongue 
and throat, and neurodegenerative disorders. Similarly, polypeptides and antibodies 
5 directed to those polypeptides are useful to provide immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the neural and digestive systems, expression 
of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., tongue, throat, brain, cancerous and wounded tissues) or bodily 

10 fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
123 as residues: Pro-44 to Ala-60, Val-187 to Thr-193, Lys-203 to Ala-210, Thr-212 

15 toCys-219 r 

The tissue distribution in tongue and pharynx carcinoma tissue indicates that 
the protein products of this clone are useful for diagnosing and/or treating oral 
cancers, including tumors of the throat and tongue. Furthermore, the tissue 
distribution in brain tissue suggests that the protein product of this clone is useful for 

20 the detection/treatment of neurodegenerative disease states and behavioural disorders 
such as neuronal ceroid lipofuscinoses (NCLs), Alzheimers Disease, Parkinsons 
Disease, Huntingtons Disease, Tourette Syndrome, schizophrenia, mania, dementia, 
paranoia, obsessive compulsive disorder, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 

25 patterns, balance, and perception. In addition, the gene or gene product may also play 
a role in the treatment and/or detection of developmental disorders associated with the 
developing embryo, or sexually-linked disorders. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:26 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1737 of SEQ ID NO:26, b 
is an integer of 15 to 175 1 , where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:26, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 17 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: 

QAWHEVGGGVRRCWFVLGERRAGSLLSASYGTFAMPG 

1 5 MVLFGRRWAIASDDLVFPGFFELVVRVLWWIGILTLYL (SEQ ID NO:227), 

and/or PGMVLFGRRWAIASDDLVFPGFFELVVRVLWWIGILTLYLMHRGKLD 
CAGGALLSSYLIVLMILLAVVICTVSAIMCVSMRGTICNPGPRKSMSKLLYIRL 
ALFFPEMVWASLGAAWVADGVQCD (SEQ ID NO:228). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 

20 It has been discovered that this gene is expressed in activated neutrophils, 

infant brain tissue and primary dendritic cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: disorders of the immune system, 

25 and neurodegenerative disorders. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differentia] 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune and neural systems, expression of this 
gene at significantly higher or lower levels may be detected in certain tissues or cell 

30 types (e.g., immune, brain, cancerous and wounded tissues) or bodily fluids (e.g., 

lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 
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having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
124 as residues: Pro-47 to Met-53, Ser-130 to Ser-138. 
5 The tissue distribution in neutrophils and primary dendritic cells indicates that 

the protein products of this clone are useful for diagnosing and/or treating immune 
system disorders. Expression of this gene product in neutrophils and primary dendritic 
cells suggests a role in the. regulation of the proliferation; survival; differentiation; 
and/or activation of potentially all hematopoietic cell lineages, including blood stem 

10 cells. This gene product may be involved in the regulation of cytokine production, 
antigen presentation, or other processes that may also suggest a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 

15 and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 

20 various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Expression of this gene product in neutrophils and primary dendritic cells also 
strongly suggests a role for this protein in immune function and immune surveillance. 

Alternatively, the tissue distribution in brain tissue suggests that the protein 
product of this clone is useful for the detection/treatment of neurodegenerative 

25 disease states and behavioural disorders such as Alzheimers Disease, Parkinsons 

Disease, Huntingtons Disease, Tourette Syndrome, schizophrenia, mania, dementia, 
paranoia, obsessive compulsive disorder, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, the gene or gene product may also play 

30 a role in the treatment and/or detection of developmental disorders associated with the 
developing embryo, or sexually-linked disorders. Protein, as well as, antibodies 
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directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
5 related to SEQ ID NO:27 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
10 general formula of a-b, where a is any integer between 1 to 1 198 of SEQ ID NO:27, b 
is an integer of 15 to 1212, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:27, and where b is greater than or equal to a 
+ 14. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 18 

It has been discovered that this gene is expressed primarily in neutrophils, and 
to a lesser extent in other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

20 diagnosis of the following diseases and conditions: immune and inflammatory 

disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at significantly higher or lower levels 

25 may be detected in certain tissues or cell types (e.g., immune, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

30 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

125 as residues: Gin- 17 to Ser-24. 
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The tissue distribution in neutrophils indicates that the protein products of this 
clone are useful for the diagnosis and/or treatment of immune and inflammatory 
disorders. Expression of this gene product in neutrophils suggests a role in the 
regulation of the proliferation; survival; differentiation; and/or activation of 
5 potentially all hematopoietic cell lineages, including blood stem cells. This gene 
product may be involved in the regulation of cytokine production, antigen 
presentation, or other processes that may also suggest a usefulness in the treatment of 
cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 

10 well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 

15 commercial utility in the expansion of stem cells and committed progenitors of 

various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Expression of this gene product in neutrophils also strongly suggests a role for 
this protein in immune function and immune surveillance. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 

20 immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:28 and may have been publicly available prior to conception- of 
the present invention. Preferably, such related polynucleotides are specifically 

25 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1098 of SEQ ID NO:28, b 
is an integer of 15 to 1112, where both a and b correspond to the positions of 

30 nucleotide residues shown in SEQ ID NO:28, and where b is greater than or equal to a 
+ 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 19 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: HERNCFPMWLNHSAFPPV (SEQ ID NO:229). 
5 Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in neutrophils, and 
to a lesser extent in other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
10 diagnosis of the following diseases and conditions: immune and inflammatory 

disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at significantly higher or lower levels 
15 may be detected in certain tissues or cell types (e.g., immune, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

20 The tissue distribution in neutrophils indicates that the protein products of this 

clone are useful for the diagnosis and/or treatment of immune and inflammatory 
disorders. Expression of this gene product in neutrophils suggests a role in the 
regulation of the proliferation; survival; differentiation; and/or activation of 
potentially all hematopoietic cell lineages, including blood stem cells. This gene 

25 product may be involved in the regulation of cytokine production, antigen 

presentation, or other processes that may also suggest a usefulness in the treatment of 
cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 

30 and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
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deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
5 types. Expression of this gene product in neutrophils also strongly suggests a role for 
this protein in immune function and immune surveillance. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

10 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO: 29 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

15 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 734 of SEQ ID NO:29, b 
is an integer of 15 to 748, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 29, and where b is greater than or equal to a 
+ 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 20 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: GWTRENDHRALSKAGIGSAEIQPSNLRVGSAKDLGKPW 
AGKLLLLSSCLLFFSLGVLYRGQMLAPPLQEDWKGGVKDSDLIDDSSASPIPP 
25 SYLEYKAALYPFSEHKSVRNATDSLTFFLVTDHFLDNQDSQ (SEQ ID 

NO:230), GWTRENDHRALSKAGIGSAEIQPSNLRVGSAKDLGKPWAGKLLLL 
(SEQ ID NO:231), 

SSCLLFFSLGVLYRGQMLAPPLQEDWKGGVKDSDLIDDSSASPIPP (SEQ ID 
NO:232), and/or S YLEYKAALYPFSEHKSVRNATDSLTFFLVTDHFL DNQDSQ 
30 (SEQ ID NO:233). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in ovarian cancer 
tissue, and to a lesser extent in other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: ovarian cancer. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the ovaries, 
expression of this gene at significantly higher or lower levels may be detected in 
10 certain tissues or cell types (e.g., reproductive, cancerous and wounded tissues) or 
bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue from an individual not having the 
disorder. 

15 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

127 as residues: Thr-20 to Gly-27, Gly-32 to Phe-41. 

The tissue distribution in ovarian cancer tissue indicates that the protein 
products of this clone are useful for the diagnosis and/or treatment of ovarian cancer, 
as well as cancers of other tissues where expression has been observed. Protein, as 

20 well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 30 and may have been publicly available prior to conception of 

25 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 764 of SEQ ID NO:30, b 

30 is an integer of 15 to 778, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO:30, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 21 

When tested against U937 Myeloid cell lines, supernatants removed from cells 
containing this gene activated the GAS assay. Thus, it is likely that this gene activates 
myeloid cells, and to a lesser extent other cells, through the Jak-STAT signal 
transduction pathway. The gamma activating sequence (GAS) is a promoter element 
found upstream of many genes which are involved in the Jak-STAT pathway. The 
Jak-STAT pathway is a large, signal transduction pathway involved in the 
differentiation and proliferation of cells. Therefore, activation of the Jak-STAT 
pathway, reflected by the binding of the GAS element, can be used to indicate 
proteins involved in the proliferation and differentiation of cells. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: LKFHQESLSGD (SEQ ID NO:234). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in fast-growing 
tissues such as immune/hematopoietic tissues, early developmental stage human 
tissues, and tumor tissues, and to a lesser extent in some other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: growth disorders, immune and 
inflammatory diseases, and tumorigenesis. Similarly, polypeptides and antibodies 
directed to those polypeptides are useful to provide immunological probes for 
differential identification -of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune/hematopoietic system, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., immune, cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
128 as residues: Glu-60 to Arg-65. 

The tissue distribution in immune tissues, in conjunction with the biological 
activity data, indicates that the protein products of this clone are useful for the 
5 diagnosis and/or treatment of growth disorders, immune and inflammatory diseases, 
and tumorigenesis. Furthermore, expression within embryonic tissue and other 
cellular sources marked by proliferating cells suggests that this protein may play a 
role in the regulation of cellular division, and may show utility in the diagnosis and 
treatment of cancer and other proliferative disorders. Similarly, embryonic 

10 development also involves decisions involving cell differentiation and/or apoptosis in 
pattern formation. Thus, this protein may also be involved in apoptosis or tissue 
differentiation and could again be useful in cancer therapy. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

15 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:31 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

20 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1310 of SEQ ID NO:31, b 
is an integer of 15 to 1324, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:31, and where b is greater than or equal to a 

25 + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 22 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: EAKSRPVTQAGVQWHDLGSLQPLPP (SEQ ID NO:235). 
30 Polynucleotides encoding these polypeptides are also encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in ovarian cancer 
tissue, and to a lesser extent in fetal liver/spleen and retinal tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: ovarian cancer, immune disorders, 
and retinal disorders. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the ovaries, immune and ocular systems, expression of this gene at 

10 significantly higher or lower levels may be detected in certain tissues or cell types 

(e.g., reproductive, ovaries, retina, immune, cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

15 The tissue distribution in ovarian cancer tissue indicates that the protein 

products of this clone are useful for the diagnosis and/or treatment of ovarian cancer, 
as well as cancers of other tissues where expression has been observed. The tissue 
distribution also suggests that the protein product of this clone is useful for the 
diagnosis and treatment of a variety of immune system disorders. Expression of this 

20 gene product in fetal liver/spleen suggests a role in the regulation of the proliferation; 
survival; differentiation; and/or activation of potentially all hematopoietic cell 
lineages, including blood stem cells. This gene product may be involved in the 
regulation of cytokine production, antigen presentation, or other processes that may 
also suggest a usefulness in the treatment of cancer (e.g. by boosting immune 

25 responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
30 deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
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commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Alternatively, the tissue distribution in retinal tissue suggests that the protein 
product of this clone is useful for the treatment and/or detection of eye disorders 
5 including blindness, color blindness, impaired vision, short and long sightedness, 
retinitis pigmentosa, retinitis proliferans, and retinoblastoma, retinochoroiditis, 
retinopathy and retinoschisis. Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

10 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:32 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

15 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 725 of SEQ ID NO:32, b 
is an integer of 15 to 739, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:32, and where b is greater than or equal to a 
20 + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 23 

The translation product of this gene shares sequence homology with a C. 
elegans protein of unknown function (See Genbank Accession No.: 

25 gnllPIDIe 134801 7). When tested against fibroblast cell lines, supernatants removed 
from cells containing this gene activated the EGR1 assay. Thus, it is likely that this 
gene activates fibroblast cells through a signal transduction pathway. Early growth 
response 1 (EGR1) is a promoter associated with certain genes that induces various 
tissues and cell types upon activation, leading the cells to undergo differentiation and 

30 proliferation. The gene encoding the disclosed cDNA is thought to reside on 
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chromosome 17. Accordingly, polynucleotides related to this invention are useful as a 
marker in linkage analysis for chromosome 17. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: EAKSRPVTQAGVQWHDLGSLQPLPP (SEQ ID NO:236), 
and/or ALVLVCRQRYCRPRDLLQRYDSKPIVDLIGAMETQSEPSELELDDVVIT 
NPHffiAILENEDWIEDASGLMSHCIAILKICHTLTEKI.VAMTMGSGAKMKTSA 
SVSDIIVVAKRISPRVDDVVKSMYPPLDPKLLDAR (SEQ ID NO:237). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in fast growing 
tissues such as early development stage human tissues, immune/hematopoietic 
tissues, melanocytes, and tumor tissues, and to a lesser extent in some other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: growth disorders, immune and 
inflammatory disoders, skin and connective tissue disorders, and tumorigenesis. 
Similarly, polypeptides and antibodies directed to those polypeptides are useful to 
provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the fast 
growing tissues such as early development stage human tissues, 
immune/hematopoietic tissues, skin and connective tissue, and tumor tissues, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., musculo-skeletal, skin, immune, developing, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
130 as residues: Pro-34 to Ser-43, Glu-54 to Ser-60. 

The tissue distribution suggests that the protein product of this clone is useful 
for the diagnosis and/or treatment of growth disorders, immune and inflammatory 
disorders, and tumorigenesis. Alternatively, the tissue distribution in melanocytes, in 
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conjunction with the observed biological activity data, suggests that the protein 
product of this clone is useful for the treatment, diagnosis, and/or prevention of 
various skin disorders including congenital disorders (i.e. nevi, moles, freckles, 
Mongolian spots, hemangiomas, port-wine syndrome), integumentary tumors (i.e. 
5 keratoses, Bowen's disease, basal cell carcinoma, squamous cell carcinoma, 

malignant melanoma, Paget 's disease, mycosis fungoides, and Kaposi's sarcoma), 
injuries and inflammation of the skin (i.e.wounds, rashes, prickly heat disorder, 
psoriasis, dermatitis), atherosclerosis, uticaria, eczema, photosensitivity, autoimmune 
disorders (i.e. lupus erythematosus, vitiligo, dermatomyositis, morphea, scleroderma, 
10 pemphigoid, and pemphigus), keloids, striae, erythema, petechiae, purpura, and 
xanthelasma. 

Moreover, such disorders may predispose increased susceptibility to viral and 
bacterial infections of the skin (i.e. cold sores, warts, chickenpox, molluscum 
contagiosum, herpes zoster, boils, cellulitis, erysipelas, impetigo, tinea, althletes foot, 
15 and ringworm). Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and immunotherapy targets for the above listed tumors and 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

20 related to SEQ ID NO:33 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 

25 general formula of a-b, where a is any integer between 1 to 1448 of SEQ ID NO: 33, b 
is an integer of 15 to 1462, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:33, and where b is greater than or equal to a 
+ 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 24 

When tested against U937 Myeloid cell lines, supernatants removed from cells 
containing this gene activated the GAS assay. Thus, it is likely that this gene activates 
myeloid cells, and to a lesser extent other cells, through the Jak-STAT signal 
5 transduction pathway. The gamma activating sequence (GAS) is a promoter element 
found upstream of many genes which are involved in the Jak-STAT pathway. The 
Jak-STAT pathway is a large, signal transduction pathway involved in the 
differentiation and proliferation of cells. Therefore, activation of the Jak-STAT 
pathway, reflected by the binding of the GAS element, can be used to indicate 
10 proteins involved in the proliferation and differentiation of cells 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: 

DVESRGPSARCLPVVPGSLLPGLEPATKLMPGGLAPGHG 

APVRELLLPLLSQPTLGSLWDSLRHCSLLCNPLSCVPALEAPPSLVSLGCSGGC 
1 5 PRLSLAGS ASPFPFLTALLSLLNTLAQIHKGLCGQLA AILAAPGLQN YFLQCV A 
PGAAPHLTPFSAWALRHEYHLQYLALALAQKAAALQPLPATHAALYHGMAL 
ALLSRLLPGSEYLTHELLLSCVFRLEFLPERTSGGPEAADFSDQLSLGSSRVPR 
CGQGTLLAQACQDLPSIRNCYLTHCSPARASLLASQALHRGELQRVPTLLLP 
MPTEPLLPTDWPFLH (SEQ ID N 0:238), 

20 DVESRGPSARCLPVVPGSLLPGLEPATKLM PGGLAPGHGAPVRE (SEQ ID 
NO:239), LLLPLLSQPTLGSLWDSLRHCSLLCNP LSCVPALEAPPSLVSLGC 
(SEQ ID NO:240), S GGCPRLS L AGS AS PFPFLT ALL 
SLLNTLAQIHKGLCGQLAAILA (SEQ ID NO:241), APGLQNYFLQCVAPGAAP 
HLTPFSAWALRHEYHLQYLALALAQK (SEQ ID NO:242), AAALQPLPATHAA 
25 LYHGMALALLSRLLPGSEYLTHELLLSCVFR (SEQ ID NO:243), LEFLPERTSG 
GPEAADFSDQLSLGSSRVPRCGQGTLLAQACQDL (SEQ ID NO:244), and/or 
PSIRNCYLTHCSPARASLLASQALHRGELQRVPTLLLPMPTEPLLPTDWPFLH 
(SEQ ID NO:245). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in hematopoietic 
tissues and fetal heart tissue, and to a lesser extent in brain and gall bladder tissues, 
and some other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and inflammatory 
disorders, cardiovascular disorders, and growth disorders. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the hematopoietic and vascular systems, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., vascular, immune, cancerous and wounded tissues) 
or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
13 1 as residues: Tyr-88 to Trp-102, Asp-105 to Ser-1 10. 

The tissue distribution in hematopoietic tissues, in conjunction with the 
observed biological activity data, indicates that the protein products of this clone are 
useful for the diagnosis and/or treatment of immune and inflammatory disorders and 
growth disorders. Alternatively, the tissue distribution in fetal heart tissue indicates 
that the protein product of this gene is useful for the diagnosis and treatment of 
conditions and pathologies of the cardiovascular system, such as heart disease, 
restenosis, atherosclerosis, stoke, angina, thrombosis, and wound healing. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 34 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
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excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 2801 of SEQ ID NO:34, b 
5 is an integer of 15 to 2815, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO: 34, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 25 

10 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: VGSVLGAFLTFPGLRLAQTHRDALT (SEQ ID NO:246). 

Polynucleotides encoding these polypeptides are also encompassed by the invention. 

The gene encoding the disclosed cDNA is thought to reside on chromosome 19. 

Accordingly, polynucleotides related to this invention are useful as a marker in 
15 linkage analysis for chromosome 19. 

It has been discovered that this gene is expressed primarily in human pituitary 

tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

20 diagnosis of the following diseases and conditions: hyperpituitarism and 

hypopituitarism. Similarly, polypeptides and antibodies directed to those polypeptides 
are useful to provide immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the endocrine system, expression of this gene at significantly higher or 

25 lower levels may be detected in certain tissues or cell types (e.g., endocrine, 

cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. This gene is found on the short arm 

30 of chromosome 19 and, therefore, is useful as a chromosome marker. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
132 as residues: Met-1 to Pro-6, Gln-89 to Ala-94, Pro-161 to Cys-173. 

The tissue distribution in pituitary tissue indicates that the protein products of 
this clone are useful for the diagnosis and/or treatment of pituitary disorders. More 
generally, the tissue distribution in pituitary tissue suggests that the protein product of 
this clone is useful for the detection, treatment, and/or prevention of various 
endocrine disorders and cancers, particularly Addison's disease, Cushing's 
Syndrome, and disorders and/or cancers of the pancrease (e.g. diabetes mellitus), 
adrenal cortex, ovaries, pituitary (e.g., hyper-, hypopituitarism), thyroid (e.g. hyper-, 
hypothyroidism), parathyroid (e.g. hyper-, hypoparathyroidism) , hypothalamus, and 
testes. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:35 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1064 of SEQ ID NO:35, b 
is an integer of 15 to 1078, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 35, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 26 

It has been discovered that this gene is expressed highly and specifically in 
placental and bone marrow cDNA libraries, and to a lesser extent in T-cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune, developmental and 
reproductive disorders. Similarly, polypeptides and antibodies directed to those 
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polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune and developing systems, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
5 (e.g., immune, developmental, reproductive, cancerous and wounded tissues) or 

bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue from an individual not having the 
disorder. 

10 The tissue distribution in bone marrow and placental tissue indicates that the 

protein products of this clone are useful for the diagnosis and/or treatment of immune 
and reproductive disorders. The tissue distribution in bone marrow suggests that the 
protein product of this clone is useful for the treatment and diagnosis of 
hematopoietic related disorders such as anemia, pancytopenia, leukopenia, 

15 thrombocytopenia or leukemia. The uses include bone marrow cell ex vivo culture, 
bone marrow transplantation, bone marrow reconstitution, radiotherapy or 
chemotherapy of neoplasia. The gene product may also be involved in lymphopoiesis, 
therefore, it can be used in immune disorders such as infection, inflammation, allergy, 
immunodeficiency etc. In addition, this gene product may have commercial utility in 

20 the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. 

Alternatively, the tissue distribution in placental tissue suggests that the 
protein product of this clone is useful for the diagnosis and/or treatment of disorders 
of the placenta. Specific expression within the placenta suggests that this gene 

25 product may play a role in the proper establishment and maintenance of placental 
function. Alternately, this gene product may be produced by the placenta and then 
transported to the embryo, where it may play a crucial role in the development and/or 
survival of the developing embryo or fetus. 

Expression of this gene product in a vascular-rich tissue such as the placenta 

30 also suggests that this gene product may be produced more generally in endothelial 

cells or within the circulation. In such instances, it may play more generalized roles in 
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vascular function, such as in angiogenesis. It may also be produced in the vasculature 
and have effects on other cells within the circulation, such as hematopoietic cells. It 
may serve to promote the proliferation, survival, activation, and/or differentiation of 
hematopoietic cells, as well as other cells throughout the body. Protein, as well as, 
5 antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:36 and may have been publicly available prior to conception of 

10 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1203 of SEQ ID NO:36, b 

15 is an integer of 15 to 1217, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:36, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 27 

20 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: 

LECTDTIMVHCSLKLLSPSDXSHSASQVAKTRGVHHXTQ 
LIFKVFFVXMGS HSTK YXSIRPGLLP (SEQ ID NO:247). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 

25 It has been discovered that this gene is expressed primarily in human prostate 

and smooth muscle tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: disorders in the prostate gland, 

30 vascular and connective tissues. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differential 
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identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the male reproductive and urinary system and vascular 
system, expression of this gene at significantly higher or lower levels may be detected 
in certain tissues or cell types (e.g., reproductive, vascular, cancerous and wounded 
5 tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

The tissue distribution in prostate and smooth muscle tissues indicates that the 

10 protein products of this clone are useful for the diagnosis and/or treatment of prostate 
gland, vascular and connective tissue disorders. The tissue distribution in smooth 
muscle tissue indicates that the protein product of this gene is useful for the diagnosis 
and treatment of conditions and pathologies of the cardiovascular system, such as 
heart disease, restenosis, atherosclerosis, stoke, angina, thrombosis, and wound 

15 healing. The expression in the prostate tissue may indicate the gene or its products 

can be used in the disorders of the prostate, including inflammatory disorders, such as 
chronic prostatitis, granulomatous prostatitis and malacoplakia, prostatic hyperplasia 
and prostate neoplastic disorders, including adenocarcinoma, transitional cell 
carcinomas, ductal carcinomas, squamous cell carcinomas, or as hormones or factors 

20 with systemic or reproductive functions. Protein, as well as, antibodies directed 

against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

25 related to SEQ ID NO:37 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 

30 general formula of a-b, where a is any integer between 1 to 1268 of SEQ ID NO: 37, b 
is an integer of 15 to 1282, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO:37, and where b is greater than or equal to a 
+ 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 28 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: ESSFVPPAAHSSLC (SEQ ID NO:248). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 
5 It has been discovered that this gene is expressed primarily in human pituitary 

tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: hyperpituitarism and 

10 hypopituitarism. Similarly, polypeptides and antibodies directed to those polypeptides 
are useful to provide immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the endocrine system, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., endocrine, 

15 cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

The tissue distribution in pituitary tissue indicates that the protein products of 

20 this clone are useful for the diagnosis and/or treatment of pituitary gland disorders 
such as hypopituitarism and hyperpituitarism. More generally, the tissue distribution 
in pituitary tissue suggests that the protein product of this clone is useful for the 
detection, treatment, and/or prevention of various endocrine disorders and cancers, 
particularly Addison's disease, Cushing's Syndrome, and disorders and/or cancers of 

25 the pancrease (e.g. diabetes mellitus), adrenal cortex, ovaries, pituitary (e.g., hyper-, 
hypopituitarism), thyroid (e.g. hyper-, hypothyroidism), parathyroid (e.g. hyper-, 
hypoparathyroidism) , hypothalamus, and testes. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ED NO:38 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 545 of SEQ ID NO:38, b 
is an integer of 15 to 559, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:38, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 29 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: 

LLPGQQEATQCVEAGAGEGALTPMCPCRQEQFVDLYKEF 
EPSLVNSTVYIMAMAIQMAPFAINYKVRPGPCXNIHCLPTQPHPMKPSVPHPH 
RARPSWRACPRTSPWCGVWQFHSWPSLACSSAPRPTSTASLASWTSLWSSS 
WSLPRSCSWTSAWRSWPTASCSSSWGPRS (SEQ ID NO:249), 
LLPGQQEATQCV EAGAGEGALTPMCPCRQEQFVDLYKEFEPSLVN (SEQ ID 
NO:250), STVYIMAMAIQMAPFAINYKVRPGPCXNIHCLPTQPHPMKPSVP 
(SEQ ID NO:251), 

HPHRARPSWRACPRTSPWCGVWQFHSWPSLACSSAPRPTSTA (SEQ ID 
NO:252), and/or SLASWTSLWSSSWSLPRSCSWTSAWRSWPTASCSSSWG PRS 
(SEQ ID NO:253). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

It has been discovered that this gene is expressed primarily in human pituitary 
and breast tissues, and to a lesser extent in endometrial and ovarian cancer tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: hyperpituitarism and 
hypopituitarism, and cancers of the female reproductive system. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
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immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the endocrine and 
reproductive systems, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., endocrine, reproductive, 
5 cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

10 136 as residues: Ser-3 to Lys-8. 

The tissue distribution in pituitary tissue indicates that the protein products of 
this clone are useful for the diagnosis and/or treatment of disorders in the pituitary 
gland. More generally, the tissue distribution in pituitary tissue suggests that the 
protein product of this clone is useful for the detection, treatment, and/or prevention 

15 of various endocrine disorders and cancers, particularly Addison's disease, Cushing's 
Syndrome, and disorders and/or cancers of the pancrease (e.g. diabetes mellitus), 
adrenal cortex, ovaries, pituitary (e.g., hyper-, hypopituitarism), thyroid (e.g. hyper-, 
hypothyroidism), parathyroid (e.g. hyper-, hypoparathyroidism) , hypothalamus, and 
testes. Alternatively, the tissue distribution in breast tissue and cancerous tissues of 

20 the endometrium and ovaries suggests that the translation product of this gene is 
useful for the detection and/or treatment of disorders and cancers of the female 
reproductive system, as well as cancers of other tissues where expression has been 
observed. Protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. 

25 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:39 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

30 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
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general formula of a-b, where a is any integer between 1 to 789 of SEQ ID NO:39, b 
is an integer of 15 to 803, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:39, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 30 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: TRNILSFIKCVIHNFWIPKESNEITIIINPYRETVCFSVEP 
VKKIFNY (SEQ ID NO:254). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

It has been discovered that this gene is expressed primarily in human synovial 
sarcoma tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the skeletal system, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., skeletal, connective, cancerous and wounded tissues) 
or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
137 as residues: Thr-29 to Pro-34. 

The tissue distribution in synovial sarcoma tissue indicates that the protein 
products of this clone are useful for the diagnosis and/or treatment of diseases of the 
synovium. In addition, the 

Expression of this gene product in synovium suggests a role in the detection 
and treatment of disorders and conditions affecting the skeletal system, in particular 
osteoporosis as well as disorders afflicting connective tissues (e.g. arthritis, trauma, 
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tendonitis, chrondomalacia and inflammation), such as in the diagnosis or treatment 
of various autoimmune disorders such as rheumatoid arthritis, lupus, scleroderma, and 
dermatomyositis as well as dwarfism, spinal deformation, and specific joint 
abnormalities as well as chondrodysplasias (ie. spondyloepiphyseal dysplasia 
5 congenita, familial arthritis, Atelosteogenesis type II, metaphyseal chondrodysplasia 
type Schmid). Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

10 related to SEQ ID NO:40 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 

15 general formula of a-b, where a is any integer between 1 to 1496 of SEQ ID NO:40, b 
is an integer of 15 to 1510, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:40, and where b is greater than or equal to "a 
+ 14. 



20 FEATURES OF PROTEIN ENCODED BY GENE NO: 31 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: LVVLFASSNSRYLKYFFLVPLILGSAW (SEQ ID NO:255). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in human 
25 rhabdomyosarcoma and fetal liver/spleen tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differentia] 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: malignant neoplasms and 
hematopoiesis. Similarly, polypeptides and antibodies directed to those polypeptides 
30 are useful to provide immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells. 
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particularly of the skeletal and immune system, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., musculoskeletal, immune, cancerous and wounded tissues) or bodily fluids 
(e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
5 individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
138 as residues: Gly-29 to Thr-35. 

The tissue distribution in rhabdomyosarcoma and fetal liver/spleen tissues 
10 indicates that the protein products of this clone are useful for diagnosis and treatment 
of skeletal and immune disorders. The expression in rhabdomyosarcoma tissue 
suggests that the protein product of this clone is useful for the detection, treatment, 
and/or prevention of various muscle disorders, such as muscular dystrophy, 
cardiomyopathy, fibroids, myomas, and rhabdomyosarcomas. Alternatively, 
15 Expression of this gene product in fetal liver/spleen tissue suggests a role in 

the regulation of the proliferation; survival; differentiation; and/or activation of 
potentially all hematopoietic cell lineages, including blood stem cells. This gene 
product may be involved in the regulation of cytokine production, antigen 
presentation, or other processes that may also suggest a usefulness in the treatment of 
20 cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
25 deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Protein, as well as, antibodies directed against the protein may show utility as a 
30 tumor marker and/or immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:41 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 108 1 of SEQ ID NO:41 , b 
is an integer of 15 to 1095, where both a and b correspond to the positions of 
10 nucleotide residues shown in SEQ ID NO:41, and where b is greater than or equal to a 
+ 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 32 

It has been discovered that this gene is expressed primarily in fibrosarcoma 

15 tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: fibrosarcoma. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 

20 immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the connective 
tissue system, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., musculoskeletal, cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 

25 spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
139 as residues: Ser-34 to Gln-40, Gly-42 to Glu-48, Tyr-56 to Leu-62. 

30 The tissue distribution in only fibrosarcoma tissue suggests that the protein 

product of this clone is useful for the treatment, diagnosis and/or prognosis of 
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fibrosarcoma^ or other diorders related with fibrous tissue including fibroma, 
fibromatosis, fibromyoma, fibromyositis, fibrosis and fibrositis. Likewise, the 
expression in fibrosarcoma tissue suggests that the protein product of this clone is 
useful for the detection, treatment, and/or prevention of various muscle disorders, 
5 such as muscular dystrophy, cardiomyopathy, myomas, and rhabdomyosarcomas. 

Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

10 related to SEQ ID NO:42 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 

15 general formula of a-b, where a is any integer between 1 to 1 148 of SEQ ID NO:42, b 
is an integer of 15 to 1 162, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:42, and where b is greater than or equal to a 
+ 14. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 33 

It has been discovered that this gene is expressed primarily in Hodgkins 
lymphoma and breast cancer tissues, and to a lesser extent in stromal cells and brain 
tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
25 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: lymphoma, breast cancer, and 
neurological disorders. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
30 particularly of the immune amd nervous systems, expression of this gene at 

significantly higher or lower levels may be detected in certain tissues or cell types 
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(e.g., immune, neural, cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue from an individual not having the disorder. 
5 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

140 as residues: Pro-22 to Lys-29. 

The tissue distribution in Hodgkins lymphoma, brain and breast cancer tissues 
suggests a role in the treatment, diagnosis and/or prognosis of breast cancer, immune 
and hematopoietic disorders including arthritis, asthma, immunodeficiency diseases, 

10 leukemia and Hodgkin's lymphoma and neurodegenerative disease states and 

behavioral disorders such as Alzheimer's Disease, Parkinson's Disease, Huntington's 
Disease, schizophrenia, mania, dementia, paranoia, obsessive compulsive disorder 
and panic disorder. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 

15 tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:43 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

20 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 643 of SEQ ID NO:43, b 
is an integer of 15 to 657, where both a and b correspond to the positions of 

25 nucleotide residues shown in SEQ ID NO:43, and where b is greater than or equal to a 
+ 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 34 

In specific embodiments, polypeptides of the invention comprise the following 
30 amino acid sequences: HEWKCKQKYSEGSGNTRIGN (SEQ ID NO:256). 

Polynucleotides encoding these polypeptides are also encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in chronic 
synovitis tissue, and to a lesser extent in fetal kidney and testes tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: synovitis, renal disorders and male 
infertility. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the connective tissue system, the renal system, and the male reproductive system, 
10 expression of this gene at significantly higher or lower levels may be detected in 

certain tissues or cell types (e.g., skeletal, renal, reproductive, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
1 5 having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
141 as residues: Met-33 to Pro-39, Ser-74 to Trp-79. 

The tissue distribution of this gene in chronic synovitis, testes, and kidneys 
suggests a role in the treatment, diagnosis and prognosis of synovial membrane 
20 disorders including synovitis, renal disorders including kidney failure, renal colic, 

renal diabetes, hypertension, osteodystrophy, tubular acidosis and kidney stones; and 
and male infertility. Furthermore, the tissue distribution in testes tissue indicates that 
the protein product of this clone is useful for the treatment and/or diagnosis of 
conditions concerning proper testicular function (e.g. endocrine function, sperm 
25 maturation), as well as cancer. Therefore, this gene product is useful in the treatment 
of male infertility and/or impotence. This gene product is also useful in assays 
designed to identify binding agents, as such agents (antagonists) are useful as male 
contraceptive agents. Similarly, the protein is believed to be useful in the treatment 
and/or diagnosis of testicular cancer. The testes are also a site of active gene 
30 expression of transcripts that may be expressed, particularly at low levels, in other 
tissues of the body. Therefore, this gene product may be expressed in other specific 
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tissues or organs where it may play related functional roles in other processes, such as 
hematopoiesis, inflammation, bone formation, and kidney function, to name a few 
possible target indications. In addition, the 

Expression of this gene product in synovium suggests a role in the detection 
5 and/or treatment of disorders and conditions affecting the skeletal system, in 

particular osteoporosis as well as disorders afflicting connective tissues (e.g. arthritis, 
trauma, tendonitis, chrondomalacia and inflammation), such as in the diagnosis or 
treatment of various autoimmune disorders such as rheumatoid arthritis, lupus, 
scleroderma, and dermatomyositis as well as dwarfism, spinal deformation, and 
10 specific joint abnormalities as well as chondrodysplasias (ie. spondyloepiphyseal 
dysplasia congenita, familial arthritis, Atelosteogenesis type II, metaphyseal 
chondrodysplasia type Schmid). Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

15 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:44 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

20 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 141 of SEQ ID NO:44, b 
is an integer of 15 to 1 155, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:44, and where b is greater than or equal to a 

25 + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 35 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: LLPLCFLGPRQVLEEFPSIV (SEQ ID NO:257). 
30 Polynucleotides encoding these polypeptides are also encompassed by the invention. 



BNSDOCID: <WO 9947540A1_I_> 



WO 99/47540 



PCT/US99/05804 



It has been discovered that this gene is expressed primarily in brain tissue, and 
to a lesser extent in osteoclastoma and testes tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: neurological disorders and male 
reproductive disorders. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the nervous system and the male reproductive system, expression of 

10 this gene at significantly higher or lower levels may be detected in certain tissues or 
cell types (e.g., neural, reproductive, cancerous and wounded tissues) or bodily fluids 
(e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

15 The tissue distribution of this gene in brain tissue suggests a role in the 

diagnosis, prognosis and/or treatment of neurodegenerative disease states and 
behavioural disorders such as Alzheimer's Disease, Parkinson's Disease, Huntinton's 
Disease, schizophrenia, mania, dementia, paranoia, obsessive compulsive disorder 
and panic disorder. In addition, the gene or gene product may also play a role in the 

20 treatment and/or detection of developmental disorders associated with the developing 
embryo, or sexually-linked disorders. Protein, as well as, antibodies directed against 
the protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

25 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:45 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

30 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1098 of SEQ ID NO:45, b 



BNSDOCID: <WO 9947540A1_I_> 



WO 99/47540 



67 



PCT/US99/05804 



is an integer of 15 to 1 1 12, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:45, and where b is greater than or equal to a 
+ 14. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 36 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: PTRPS KHQE AGS (SEQ ID NO:258). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. The gene 
encoding the disclosed cDNA is thought to reside on chromosome 3. Accordingly, 
10 polynucleotides related to this invention are useful as a marker in linkage analysis for 
chromosome 3. 

It has been discovered that this gene is expressed primarily in adult and fetal 
heart tissue, and to a lesser extent in fetal lung and fetal liver/spleen tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 

15 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: cardiovascular and immune 
disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

20 the vascular and immune systems, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., vascular, immune, 
pulmonary, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 

25 healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
143 as residues: Val-2 to Ser-14. 

The tissue distribution in heart, fetal liver and fetal spleen tissues suggests a 
role in the treatment and/or diagnosis of cardiovascular disorders including 

30 myocardial infarction, congestive heart failure, coronary failure, as well as immune 
disorders including autoimmune diseases, such as lupus, transplant rejection, allergic 
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reactions, arthritis, asthma, immunodeficiency diseases, leukemia, and AIDS. 
Furthermore, the tissue distribution in adult and fetal heart tissue indicates that the 
protein product of this gene is useful for the diagnosis and treatment of conditions and 
pathologies of the cardiovascular system, such as heart disease, restenosis, 
atherosclerosis, stoke, angina, thrombosis, and wound healing. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:46 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 4009 of SEQ ID NO:46, b 
is an integer of 15 to 4023, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:46, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 37 

It has been discovered that this gene is expressed primarily in testes tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: male infertility and reproductive 
disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the male reproductive system, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., reproductive, cancerous 
and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial 
fluid or spinal fluid) taken from an individual having such a disorder, relative to the 
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standard gene expression level, i.e., the expression level in healthy tissue from an 
individual not having the disorder. 

The tissue distribution in testes tissues suggests a role in the treatment and/or 
diagnosis of male infertility, and testicular disorders including cancer. Furthermore, 
5 the tissue distribution in testes tissue indicates that the protein product of this clone is 
useful for the treatment and diagnosis of conditions concerning proper testicular 
function (e.g. endocrine function, sperm maturation), as well as cancer. Therefore, 
this gene product is useful in the treatment of male infertility and/or impotence. This 
gene product is also useful in assays designed to identify binding agents, as such 

10 agents (antagonists) are useful as male contraceptive agents. Similarly, the protein is 
believed to be useful in the treatment and/or diagnosis of testicular cancer. The testes 
are also a site of active gene expression of transcripts that may be expressed, 
particularly at low levels, in other tissues of the body. Therefore, this gene product 
may be expressed in other specific tissues or organs where it may play related 

1 5 functional roles in other processes, such as hematopoiesis, inflammation, bone 

formation, and kidney function, to name a few possible target indications. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

20 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:47 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

25 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 528 of SEQ ID NO:47, b 
is an integer of 15 to 542, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:47, and where b is greater than or equal to a 
+ 14. 

30 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 38 

It has been discovered that this gene is expressed primarily in apoptotic T- 
cells, and to a lesser extent in brain tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and neurological 
disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune and nervous systems, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., immune, neural, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
145 as residues: Glu-33 to Tyr-42. 

The tissue distribution in apoptotic T-cells suggests potential roles in the 
treatment and/or diagnosis of immune disorders including of immune and 
autoimmune diseases, such as lupus, transplant rejection, allergic reactions, arthritis, 
asthma, immunodeficiency diseases, leukemia, and AIDS. Alternatively, expression 
in brain tissue suggests a role in the treatment and/or diagnosis of neurodegenerative 
disease states and behavioural disorders such as Alzheimer's Disease, Parkinson's 
Disease, Huntinton's Disease, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder and panic disorder. Furthermore, the tissue distribution in 
apoptotic T-cells indicates that the translation product of this gene may also be 
involved in apoptosis or tissue differentiation and could again be useful in cancer 
therapy. Protein, as well as, antibodies directed against the protein may show utility as 
a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:48 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1481 of SEQ ID NO:48, b 
is an integer of 15 to 1495, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:48, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 39 

The translation product of this gene shares sequence homology with 
phosphomannomutase, which is thought to be important in mannose matabolism. 

It has been discovered that this gene is expressed primarily in meningioma and 
15 testis tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: meningioma related diseases. 
Similarly, polypeptides and antibodies directed to those polypeptides are useful to 

20 provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
central nervous system, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., neural, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 

25 fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
146 as residues: Ser-33 to Lys-43. 

30 The tissue distribution in meningioma, and the homology to 

phosphomannomutase, suggests that the protein product of this clone is useful for the 
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diagnosis and/or intervention of meningioma related diseases. For example, the gene 
product can be used for preventing microbial infection of the meninges, for imaging 
conjugates, or as a secretory factor as a endocrine with systemic, central or peripheral 
nerve functions. Protein, as well as, antibodies directed against the protein may show 
5 utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 
Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:49 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

10 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 804 of SEQ ID NO:49, b 
is an integer of 15 to 818, where both a and b correspond to the positions of 

15 nucleotide residues shown in SEQ ID NO:49, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 40 

It has been discovered that this gene is expressed primarily in tonsils, 
20 osteoclastoma and retinoic acid treated teratocarcinoma cells, and to a lesser extent in 
macrophages, female bladder, adipose tissue, myeloid progenitor cells, prostate tissue, 
and number of other tissues and organs. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
25 diagnosis of the following diseases and conditions: tonsils and osteoclast related 
diseases. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune and bone systems, expression of this gene at significantly higher or lower 
30 levels may be detected in certain tissues or cell types (e.g., immune, skeletal, 

cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
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synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
5 147 as residues: Glu-55 to Arg-61, Gln-84 to Ser-92, Ser-99 to Ser-104. 

The tissue distribution in tonsils and osteoclastoma suggests that the protein 
product of this clone is useful for the diagnosis and/or intervention of diseases related 
to tonsils or osteoclasts. For example, tonsillitis, adenoids, peritonsilar abscess, 
neoplasms, or bone related disorders like rickets, abnormalities of bone growth and 

10 modelling, facture, osteonecrosis, and osteoporosis etc. Expression of this gene 

product in osteoclastoma suggests that it may play a role in the survival, proliferation, 
and/or growth of osteoclasts. Therefore, it may be useful in influencing bone mass in 
such conditions as osteoporosis. 

Alternatively, the expression of this gene product in tonsils suggests a role in 

15 the regulation of the proliferation; survival; differentiation; and/or activation of 
potentially all hematopoietic cell lineages, including blood stem cells. This gene 
product may be involved in the regulation of cytokine production, antigen 
presentation, or other processes that may also suggest a usefulness in the treatment of 
cancer (e.g. by boosting immune responses). 

20 Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 

well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 

25 bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO: 50 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1697 of SEQ ID NO: 50, b 
is an integer of 15 to 171 1, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 50, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 41 

It has been discovered that this gene is expressed primarily in resting T-cells. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

15 diagnosis of the following diseases and conditions: T-cell related disorders. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be detected in 

20 certain tissues or cell types (e.g., immune, cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

The tissue distribution in resting T-cells suggests that the protein product of 

25 this clone is useful for the diagnosis and/or intervention of T-cell related disorders, 
such as infection, inflammation, allergy, tissue/organ transplantation, immune 
deficiency etc. Furthermore, the expression of this gene product in T cells also 
strongly suggests a role for this protein in immune function and immune surveillance. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 

30 marker and/or immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:51 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 735 of SEQ ID NO:51, b 
is an integer of 15 to 749, where both a and b correspond to the positions of 
10 nucleotide residues shown in SEQ ID NO:5 1 , and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 42 

The translation product of this gene shares weak sequence homology with 

15 Human metastasis suppressor KiSS-1 fragment, which is thought to be important in 
the diagnosis, prevention, staging and/or treatment of cancers, such as melanoma (See 
Accession No. W 15789). 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: GQGPAGRWVRRLPCSRRAGGERGPHWGVWAGPQM 

20 SCGLXFGP (SEQ ID NO:259), WRTQGPMVLLWVVTCPATMLTEPQNPHLIGF 
VAYSGPSHTTQPHKYWLLLDGQADPAAAEGPVKRKAASVVWWPQALRHLS 
LLVHCWEESYEMNIGCQSLWAGGLASSGNGWDLGVAFRRDTCMSSSSLHW 
KEFKYAPGSLHYFALSFVLILTEICLVSSGMGFPQEGKHFSVLGSPDCSLWGR 
DEHVPREFA (SEQ ID NO:2 6 0 ), 

25 WRTQGPMVLLWVVTCPATMLTEPQNPHLIGFVAY SGPSHTTQ (SEQ ID 
NO:261), PHKYWLLLDGQADPAAAEGPVKRKAASVVWW PQALRHLSLL 
(SEQ ID NO:262), VHCWEES YEMNIGCQSLWAGGLASSGNGW 
DLGVAFRRDTCM (SEQ ID NO:2 6 3 ), 

SSSSLHWKEFKYAPGSLHYFALSFVLILT EICLVSSGMGFPQEG (SEQ ID 

30 NO:264), and/or KHFSVLGSPDCSLWGRDEHV PREFA (SEQ ID NO:265). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 
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The gene encoding the disclosed cDNA is thought to reside on chromosome 1. 
Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 1 . 

It has been discovered that this gene is expressed primarily in tonsils, 
osteoclastoma and teratocarcinoma tissues, and to a lesser extent in female bladder, 
adipose tissue, myeloid progenitor, prostate tissue, and number of other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: diseases related to tonsils and 
osteoclasts. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune and bone system, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., immune, skeletal, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

The tissue distribution in tonsils and osteoclastoma tissues suggests that the 
protein product of this clone is useful for the diagnosis and/or treatment of diseases 
related to tonsils and osteoclasts. For example, tonsillitis, adenoids, peritonsilar 
abscess, neoplasms, or abnormal growth and modelling of the bone, osteonecrosis, 
osteoporosis, osteodystrophy, osteoclastoma etc. Expression of this gene product in 
osteoclastoma suggests that it may play a role in the survival, proliferation, and/or 
growth of osteoclasts. Therefore, it may be useful in influencing bone mass in such 
conditions as osteoporosis. 

Moreover, the expression of this gene product in tonsils suggests a role in the 
regulation of the proliferation; survival; differentiation; and/or activation of 
potentially all hematopoietic cell lineages, including blood stem cells. This gene 
product may be involved in the regulation of cytokine production, antigen 
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presentation, or other processes that may also suggest a usefulness in the treatment of 
cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also 
used as an agent for immunological disorders including arthritis, asthma, immune 
deficiency diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory 
bowel disease, sepsis, acne, and psoriasis. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:52 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1077 of SEQ ID NO:52, b 
is an integer of 15 to 1091, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:52, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 43 

The translation product of this gene shares sequence homology with the 
Drosophila gene "maleless", which is one of four known regulatory loci required for 
increased transcription (dosage compensation) of X-linked genes (See Genbank 
Accession No.: gill57906). 

It has been discovered that this gene is expressed primarily in normal prostate 
tissue, testes tissue, whole 6-week old embryonic tissue, human colon carcinoma 
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(HCC) cell line, and cerebellum tissue, and to a lesser extent in primary breast cancer, 
activated T-cells, and many other tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: diseases of the prostate or colon, 
or male reproductive disorders. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the prostate or colon carcinoma, and male reproductive 
disorders, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., colon, prostate, reproductive, cancerous 
and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial 
fluid or spinal fluid) taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue from an 
individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
150 as residues: Val-39 to Ala-45. 

The tissue distribution in colon and prostate tissues suggests that the protein 
product of this clone is useful for the diagnosis and/or treatment of prostate disorders 
such as prostatitis, prostatic hyperplasia, prostate cancers, or human colon carcinoma, 
as well as cancers of other tissues where expression has been observed. Alternatively, 
the tissue distribution in testes tissue, in conjunction with the homology to the 
Drosophila maleless gene, suggests that the translation product of this gene is useful 
for the detection and/or treatment of disorders involving the testes or the transcription 
of X-linked genes. Furthermore, the tissue distribution indicates that the protein 
product of this clone is useful for the treatment and diagnosis of conditions 
concerning proper testicular function (e.g. endocrine function, sperm maturation), as 
well as cancer. Therefore, this gene product is useful in the treatment of male 
infertility and/or impotence. 

This gene product is also useful in assays designed to identify binding agents, 
as such agents (antagonists) are useful as male contraceptive agents. Similarly, the 
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protein is believed to be useful in the treatment and/or diagnosis of testicular cancer. 
The testes are also a site of active gene expression of transcripts that may be 
expressed, particularly at low levels, in other tissues of the body. Therefore, this gene 
product may be expressed in other specific tissues or organs where it may play related 
5 functional roles in other processes, such as hematopoiesis, inflammation, bone 

formation, and kidney function, to name a few possible target indications. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

10 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:53 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

15 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 2240 of SEQ ID NO:53, b 
is an integer of 15 to 2254, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:53, and where b is greater than or equal to a 
+ 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 44 

The translation product of this gene shares weak sequence homology with 
Eimeria antigen Eam45 M3, which is thought to be important in uses as a vaccine for 
protecting chickens against coccidiosis. 

25 It has been discovered that this gene is expressed primarily in adrenal gland 

tissue, and to a lesser extent in activated T-cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: adrenal cortical insufficiency, 

30 adrenal cortical hyperfunction, neoplasia. Similarly, polypeptides and antibodies 
directed to those polypeptides are useful to provide immunological probes for 
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differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the endocrine system, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., endocrine, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue from an individual not having the disorder. 

The tissue distribution in adrenal gland tissue suggests that the protein product 
of this clone is useful for the diagnosis and/or intervention of disorders caused by 
adrenal gland abnormalities, such as adrenal cortical insufficiency, adrenal cortical 
hyperfunction, and neoplasia. More generally, the tissue distribution suggests that the 
protein product of this clone is useful for the detection, treatment, and/or prevention 
of various endocrine disorders and cancers, particularly Addison's disease, Cushing's 
Syndrome, and disorders and/or cancers of the pancrease (e.g. diabetes mellitus), 
adrenal cortex, ovaries, pituitary (e.g., hyper-, hypopituitarism), thyroid (e.g. hyper-, 
hypothyroidism), parathyroid (e.g. hyper-, hypoparathyroidism) , hypothalamus, and 
testes. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:54 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 472 of SEQ ID NO:54, b 
is an integer of 15 to 486, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:54, and where b is greater than or equal to a 
+ 14. 
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The translation product of this gene shares sequence homology with neural 
thread protein, tumor necrosis factor related gene product, human alpha- 1C2 
adrenalin receptor, which is thought to be important for diagnosing the presence of 
Alzheimer's disease, neuroectodermal tumours and a malignant astrocytoma, or 
5 diagnosis of hepatocellular carcinomas and preneoplastic or pathological conditions 
of the liver, and tumor immunity. 

It has been discovered that this gene is expressed primarily in activated T-cells 
and endothelial cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
10 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: Alzheimer's disease, 
neuroectodermal tumours and a malignant astrocytoma, hepatocellular carcinomas 
and tumors of various origins. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
15 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system and endothelial cells, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., immune, endothelial, cancerous and wounded tissues) or bodily fluids (e.g., 
lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 
20 having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
152 as residues: Arg-38 to Arg-47. 

The tissue distribution in immune and endothelial tissues, and the homology to 
25 neural thread protein, tumor necrosis factor related gene product, human alpha- 1C2 
adrenalin receptor, or Smaller hepatocellular oncoprotein (hhcm) gene product 
suggests that the protein product of this clone is useful for the diagnosis and/or 
treatment of tumors of various origins, including neuroectodermal tumours and a 
malignant astrocytoma, hepatocellular carcinomas, as well as syndromes inflicted by 
30 these cancers. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:55 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1256 of SEQ ID NO:55, b 
is an integer of 15 to 1270, where both a and b correspond to the positions of 
10 nucleotide residues shown in SEQ ID NO:55, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 46 

It has been discovered that this gene is expressed primarily in tumor tissues 

15 such as hepatocellular tumor, hemangiopericytoma, chronic lymphocytic leukemia, 
and activated T-cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: tumors of various origins. 

20 Similarly, polypeptides and antibodies directed to those polypeptides are useful to 
provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
hepatocellular tumor, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., liver, immune, cancerous and 

25 wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. 

The tissue distribution in hepatocellular tumors suggests that the protein 

30 product of this clone is useful for the diagnosis and/or targeting of hepatocellular 

carcinomas, preneoplastic or pathological conditions of the liver, Alzheimer's disease, 



BNSDOCID: <WO 9947540A1 J_> 



WO 99/47540 



83 



PCT/US99/05804 



neuroectodermal tumours and malignant astrocytoma. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 56 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 2045 of SEQ ID NO:56, b 
is an integer of 15 to 2059, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:56, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 47 

It has been discovered that this gene is expressed primarily in glioblastoma, 
ulcerative colitis, and hemangiopericytoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: glioblastoma, 
hemangiopericytoma and their inflicted disorders. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the brain tissues, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., neural, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue from an individual not having the disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
154 as residues: Pro-31 to Ala-37. 

The tissue distribution suggests that the protein product of this clone would be 
useful for the diagnosis, targeting and/or treatment of tumors in the brain, such as 
5 glioblastoma and hemangiopericytoma. Additionally, the gene products can be useful 
agent for the diagnosis and treatment of ulcerative colitis. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
10 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:57 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
15 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 854 of SEQ ID NO: 57, b 
is an integer of 15 to 868, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:57, and where b is greater than or equal to a 

+ 14 - 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 48 

It has been discovered that this gene is expressed primarily in bone marrow. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

25 diagnosis of the following diseases and conditions: immunodeficiency, tumor 

necrosis, infection, lymphomas, auto-immunities, cancer, inflammation, anemias 
(leukemia) and other hematopoeitic disorders. Similarly, polypeptides and antibodies 
directed to those polypeptides are useful to provide immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 

30 the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
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(e.g., immune, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue from an individual not having the disorder. 
5 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

155 as residues: Thr-47 to Val-53. 

The tissue distribution in bone marrow suggests that the protein product of this 
clone is useful for the diagnosis and/or treatment of immune disorders including: 
leukemias, lymphomas, auto-immunities, immunodeficiencies (e.g. AIDS), immuno- 
10 supressive conditions (transplantation) and hematopoeitic disorders. In addition this 
gene product may be applicable in conditions of general microbial infection, 
inflammation or cancer. Furthermore, the tissue distribution in bone marrow suggests 
that the protein product of this clone is useful for the treatment and diagnosis of 
hematopoietic related disorders such as anemia, pancytopenia, leukopenia, 
15 thrombocytopenia or leukemia. 

The uses include bone marrow cell ex vivo culture, bone marrow 
transplantation, bone marrow reconstitution, radiotherapy or chemotherapy of 
neoplasia. The gene product may also be involved in lymphopoiesis, therefore, it can 
be used in immune disorders such as infection, inflammation, allergy, 
20 immunodeficiency etc. In addition, this gene product may have commercial utility in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 
25 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:58 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
*0 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
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general formula of a-b, where a is any integer between 1 to 972 of SEQ ID NO:58, b 
is an integer of 15 to 986, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:58, and where b is greater than or equal to a 
+ 14. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 49 

It has been discovered that this gene is expressed primarily in bone marrow. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

10 diagnosis of the following diseases and conditions: immunodeficiency, tumor 

necrosis, infection, lymphomas, auto-immunities, cancer, inflammation, anemias 
(leukemia) and other hematopoeitic disorders. Similarly, polypeptides and antibodies 
directed to those polypeptides are useful to provide immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 

1 5 the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., immune, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 

20 healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
156 as residues: Leu-40 to Cys-47. 

The bone marrow tissue distribution suggests that the protein product of this 
clone would be useful for the diagnosis and treatment of immune disorders including: 

25 leukemias, lymphomas, auto-immunities, immunodeficiencies (e.g. AIDS), immuno- 
supressive conditions (transplantation) and hematopoeitic disorders. In addition this 
gene product may be applicable in conditions of general microbial infection, 
inflammation or cancer. Furthermore, the tissue distribution in bone marrow suggests 
that the protein product of this clone is useful for the treatment and diagnosis of 

30 hematopoietic related disorders such as anemia, pancytopenia, leukopenia, 
thrombocytopenia or leukemia. 
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The uses include bone marrow cell ex vivo culture, bone marrow 
transplantation, bone marrow reconstitution, radiotherapy or chemotherapy of 
neoplasia. The gene product may also be involved in lymphopoiesis, therefore, it can 
be used in immune disorders such as infection, inflammation, allergy, 
immunodeficiency etc. In addition, this gene product may have commercial utility in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:59 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 681 of SEQ ID NO:59, b 
is an integer of 15 to 695, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:59, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 50 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: IAQGTVPLTKRGVQSSGPDYPEGTLTPLPRG (SEQ ID 
NO:266 and 267). Polynucleotides encoding these polypeptides are also encompassed 
by the invention. 

It has been discovered that this gene is expressed primarily in dendritic cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune disorders and related 
conditions such as leukemias, lymphomas, inflammation, hematopoeitic disfunction, 
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arthritis and asthma. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of dendritic cells. For a number of disorders of the above tissues or cells, particularly 
of the immune system, expression of this gene at significantly higher or lower levels 
5 may be detected in certain tissues or cell types (e.g., dendritic cells, cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. 

10 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

157 as residues: Ser-25 to Phe-31, Lys-55 to Arg-61. 

The tissue distribution in dendritic cells suggests that the protein product of 
this clone is useful for the diagnosis and/or treatment of immune disorders including: 
leukemias, lymphomas, auto-immunities, immunodeficiencies (e.g. AIDS), immuno- 

15 supressive conditions (transplantation) and hematopoeitic disorders. In addition this 
gene product may be applicable in conditions of general microbial infection, 
inflammation or cancer. 

Moreover, the expression of this gene product in dendritic cells also strongly 
suggests a role for this protein in immune function and immune surveillance. Protein, 

20 as well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:60 and may have been publicly available prior to conception of 

25 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 300 of SEQ ID NO:60, b 

30 is an integer of 15 to 314, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO:60, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 51 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: DCLYLALSFPWHCHCHHHPPSGSLLYPF (SEQ ID 
NO:268). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. The translation product of this gene shares sequence homology with a C. 
elegans protein of unknown function (See Genbank Accession No.: gil 1947 142 
(AF000264)). 

It has been discovered that this gene is expressed primarily in healing 
abdominal wound tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: tissue necrosis, wound healing, 
ulceration, neoplasms or cancer. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of injured tissue, expression of this gene at significantly 
higher or lower levels may be detected in certain tissues or cell types (e.g., vascular, 
endothelial, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
158 as residues: Pro-34 to Tyr-43, Gln-73 to Cys-86, Pro-98 to Leu- 103. 

The tissue distribution in healing abdominal wound tissue suggests that the 
protein product of this clone is useful for the treatment and/or diagnosis of conditions 
involving tissue repair and wound healing. Tissue repair may be indicated in cases of 
injury to the skin or internal organs, ulceration, cellular necrosis or other conditions 
involving healing of both diseased or non-diseased, traumatized tissue. In addition, 
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because of the implications of tissue regeneration, remoldeling and growth regulation, 
the protein product of this gene may have indications in the diagnosis and treatment 
of neoplasms and cancer. 

More generally, the tissue distribution in endothelial tissue indicates that the 
5 protein product of this gene is useful for the diagnosis and treatment of conditions and 
pathologies of the cardiovascular system, such as heart disease, restenosis, 
atherosclerosis, stoke, angina, thrombosis, and wound healing. Likewise, the tissue 
distribution further suggests that the protein product of this clone is useful for the 
treatment, diagnosis, and/or prevention of various skin disorders including congenital 

10 disorders (i.e. nevi, moles, freckles, Mongolian spots, hemangiomas, port-wine 
syndrome), integumentary tumors (i.e. keratoses, Bowen's disease, basal cell 
carcinoma, squamous cell carcinoma, malignant melanoma, Paget' s disease, mycosis 
fungoides, and Kaposi's sarcoma), injuries and inflammation of the skin (i.e. wounds, 
rashes, prickly heat disorder, psoriasis, dermatitis), atherosclerosis, uticaria, eczema, 

15 photosensitivity, autoimmune disorders (i.e. lupus erythematosus, vitiligo, 

dermatomyositis, morphea, scleroderma, pemphigoid, and pemphigus), keloids, striae, 
erythema, petechiae, purpura, and xanthelasma. In addition, such disorders may 
predispose increased susceptibility to viral and bacterial infections of the skin (i.e. 
cold sores, warts, chickenpox, molluscum contagiosum, herpes zoster, boils, cellulitis, 

20 erysipelas, impetigo, tinea, althletes foot, and ringworm). Moreover, the protein 
product of this clone may also be useful for the treatment or diagnosis of various 
connective tissue disorders such as arthritis, trauma, tendonitis, chrondomalacia and 
inflammation, autoimmune disorders such as rheumatoid arthritis, lupus, scleroderma, 
and dermatomyositis as well as dwarfism, spinal deformation, and specific joint 

25 abnormalities as well as chondrodysplasias (i.e. spondyloepiphyseal dysplasia 
congenita, familial osteoarthritis, Atelosteogenesis type II, metaphyseal 
chondrodysplasia type Schmid). Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:61 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 720 of SEQ ID NO:61, b 
is an integer of 15 to 734, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 61 , and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 52 

The translation product of this gene shares sequence homology with FAR- 
17 A, which is an androgen induced protein, absent in castrated hamsters (See 
Genbank Accession No.: giil91315), as well as a male hormone-dependent gene 
15 product (See GenSeq Accession No.: R 106 12). The gene encoding the disclosed 

cDNA is thought to reside on chromosome 6. Accordingly, polynucleotides related to 
this invention are useful as a marker in linkage analysis for chromosome 6. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: ASLPPSRSRPLANMALVPCQVLRMAILLSYCSILCNYKA 

20 IEMPSHQTYGGSWKFLTFIDLVIQAVFFGICVLTDLSSLLTRGSGNQEQERQLK 
KLISLRDW (SEQ ID NO:269). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

It has been discovered that this gene is expressed primarily in fetal liver and 
spleen tissue, and to a lesser extent in a variety of other fetal tissues and brain tissues. 

25 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune disorders including 
leukemias, lymphomas; reproductive and endocrine disorders, including testicular 
cancer; and liver disorders (e.g. hepatoblastoma, metabolic diseases and conditions 

30 that are attributable to the differentiation of hepatocyte progenitor cells). Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
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immunological probes for differential identification of the tissue(s) or cell type(s). For 
.a number of disorders of the above tissues or cells, particularly of the immune and 
reproductive systems, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., immune, reproductive, cancerous 
5 and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial 
fluid or spinal fluid) taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue from an 
individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

10 159 as residues; Thr-59 to Gly-70, Tyr-132 to Glu-150. 

The tissue distribution and homology to FAR- 17 A suggests that the protein 
product of this clone is useful for the treatment and/or diagnosis of androgen related 
conditions and disorders. Male reproductive and endocrine disorders would be 
potential area of application (e.g. endocrine function, sperm maturation). It may also 

15 prove to be valuable in the diagnosis and treatment of testicular cancer. 

More generally, the protein product of this clone may be useful for the 
treatment and/or diagnosis of conditions concerning proper testicular function (e.g. 
endocrine function, sperm maturation), as well as cancer. Therefore, this gene product 
is useful in the treatment of male infertility and/or impotence. This gene product is 

20 also useful in assays designed to identify binding agents, as such agents (antagonists) 
are useful as male contraceptive agents. Similarly, the protein is believed to be useful 
in the treatment and/or diagnosis of testicular cancer. The testes are also a site of 
active gene expression of transcripts that may be expressed, particularly at low levels, 
in other tissues of the body. Therefore, this gene product may be expressed in other 

25 specific tissues or organs where it may play related functional roles in other 

processes, such as hematopoiesis, inflammation, bone formation, and kidney function, 
to name a few possible target indications. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO: 62 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1396 of SEQ ID NO:62, b 
is an integer of 15 to 1410, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:62, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 53 

Contact of cells with supernatant expressing the product of this gene has been 
shown to increase the permeability of the plasma membrane of THP-1 to calcium. 
Thus it is likely that the product of this gene is involved in a signal transduction 

15 pathway that is initiated when the product binds a receptor on the surface of the 

plasma membrane of monocytes, and to a lesser extent, in immune or hematopoietic 
cells and tissues. Thus, polynucleotides and polypeptides have uses which include, 
but are not limited to, activating monocytes. 

In specific embodiments, polypeptides of the invention comprise the following 

20 amino acid sequence: MSRSSRISGLSCPWLL (SEQ ID NO:270). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. The gene 
encoding the disclosed cDNA is believed to reside on chromosome 1 . Accordingly, 
polynucleotides related to this invention are useful as a marker in linkage analysis for 
chromosome 1. 

25 It has been discovered that this gene is expressed primarily in T-cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and hematopoietic 
diseases and/or disorders. Similarly, polypeptides and antibodies directed to those 

30 polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
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particularly of the immune and haemopoietic systems, expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., immune, hematopoietic, and cancerous and wounded tissues) or bodily fluids 
(e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
5 individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
160 as residues: Pro-42 to Cys-50, Leu-61 to Ala-66. 

The tissue distribution in T-cells, combined with the detected calcium flux 

10 activity in monocytes suggests that the protein product of this clone would be useful 
for the treatment and diagnosis of immune and hematopoietic disorders. Morever, the 
expression of this gene product suggests a role in regulating the proliferation; 
survival; differentiation; and/or activation of hematopoietic cell lineages, including 
blood stem cells. This gene product may be involved in the regulation of cytokine 

15 production, antigen presentation, or other processes suggesting a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 

20 diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
transplanted organs and tissues, such as host-versus-graft and graft- versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 

25 injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. 

Moreover, the protein may represent a secreted factor that influences the 
differentiation or behavior of other blood cells, or that recruits hematopoietic cells to 
sites of injury. In addition, this gene product may have commercial utility in the 

30 expansion of stem cells and committed progenitors of various blood lineages, and in 
the differentiation and/or proliferation of various cell types. Protein, as well as, 
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antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:63 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 12 17 of SEQ ID NO: 63, b 
is an integer of 15 to 1231, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:63, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 54 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: DHWPAGFLPPAPGLKFPVALEVFRKVLPAVCPTDCSGS 
AGKERNS (SEQ ID NO:271). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

It has been discovered that this gene is expressed primarily in liver. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: metabolic diseases and liver 
conditions. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the metabolic system, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., hepatic, liver, metabolic, and 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, bile, serum, plasma, 
urine, synovial fluid or spinal fluid) taken from an individual having such a disorder, 
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relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
161 as residues: Ser-31 to Gln-41. 
5 The tissue distribution in liver suggests that the protein product of this clone 

would be useful for treatment and diagnosis of disorders of the metabolic system and 
liver disorders. Morever, the protein product of this clone is useful for the detection 
and treatment of liver disorders and cancers (e.g. hepatoblastoma, jaundice, hepatitis, 
liver metabolic diseases and conditions that are attributable to the differentiation of 

10 hepatocyte progenitor cells). Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

15 related to SEQ ID NO:64 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 

20 general formula of a-b, where a is any integer between 1 to 598 of SEQ ID NO:64, b 
is an integer of 15 to 612, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:64, and where b is greater than or equal to a 
+ 14. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 55 

When tested against PC 12 cell lines, supernatants removed from cells 
containing this gene activated the EGR1 (early growth response gene 1) promoter 
element. Thus, it is likely that this gene activates sensory neuron cells, and to a lesser 
extent in other neural cells and tissues, through the EGR1 signal transduction 

30 pathway. EGR1 is a separate signal transduction pathway from Jak-STAT, genes 
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containing the EGR1 promoter are induced in various tissues and cell types upon 
activation, leading the cells to undergo differentiation and proliferation. 

It has been discovered that this gene is expressed primarily in T-cells and 
monocytes, and to a lesser extent in cancerous tissues, including cancerous colon 

5 tissue and placenta. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and haemopoietic 
disorders and cancer such as colon cancer, but also such cancers as breast cancer, 

10 cardiac tumors, pancreatic cancer, melanoma, retinoblastoma, glioblastoma, lung 

cancer, intestinal cancer, testicular cancer, stomach cancer, neuroblastoma, myxoma, 
myoma, lymphoma, endothelioma, osteoblastoma, osteoclastoma, adenoma, and the 
like. Similarly, polypeptides and antibodies directed to those polypeptides are useful 
to provide immunological probes for differential identification of the tissue(s) or cell 

15 type(s). For a number of disorders of the above tissues or cells, particularly of the 

immune and haemopoietic systems, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., immune, 
hematopoietic, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 

20 such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
162 as residues: Glu-63 to Trp-72. 

The tissue distribution in T-cells and monocytes, combined with the detected 

25 EGR1 biological activity suggests that the protein product of this clone would be 
useful for treatment and diagnosis of disorders of the immune and haemopoietic 
systems and colon and other cancers. This gene product may be involved in the 
regulation of cytokine production, antigen presentation, or other processes suggesting 
a usefulness in the treatment of cancer (e.g. by boosting immune responses). 

30 Since the gene is expressed in cells of lymphoid origin, the natural gene 

product may be involved in immune functions. Therefore it may be also used as an 
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agent for immunological disorders including arthritis, asthma, immunodeficiency 
diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
5 transplanted organs and tissues, such as host-versus-graft and graft- versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 
injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. 

Moreover, the protein may represent a secreted factor that influences the 

10 differentiation or behavior of other blood cells, or that recruits hematopoietic cells to 
sites of injury. In addition, this gene product may have commercial utility in the 
expansion of stem cells and committed progenitors of various blood lineages, and in 
the differentiation and/or proliferation of various cell types. Expression cellular 
sources marked by proliferating cells suggests this protein may play a role in the 

15 regulation of cellular division, and may show utility in the diagnosis and treatment of 
cancer and other proliferative disorders. Similarly, developmental tissues rely on 
decisions involving cell differentiation and/or apoptosis in pattern formation. 
Dysregulation of apoptosis can result in inappropriate suppression of cell death, as 
occurs in the development of some cancers, or in failure to control the extent of cell 

20 death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). 

Therefore, the polynucleotides and polypeptides of the present invention are 
useful in treating, detecting, and/or preventing said disorders and conditions, in 
addition to other types of degenerative conditions. Thus this protein may modulate 

25 apoptosis or tissue differentiation and would be useful in the detection, treatment, 

and/or prevention of degenerative or proliferative conditions and diseases. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

30 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO: 65 and may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
5 general formula of a-b, where a is any integer between 1 to 2256 of SEQ ID NO:65, b 
is an integer of 15 to 2270, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 65, and where b is greater than or equal to a 
+ 14. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 56 

The translation product of this gene has homology with several human keratin 
genes at the nucleotide level (see, for example, Troyanovsky, et al„ Eur. J. Cell Biol. 
59:127-137 (1992) which is hereby incorporated by reference herein). Based on the 
sequence similarity, the translation product of this clone is expected to share 
15 biological activities with keratin and growth factor proteins. Such activities are known 
in the art, and some of which are described elsewhere herein. 

It has been discovered that this gene is expressed primarily in neutrophils. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

20 diagnosis of the following diseases and conditions: immune and haemopoietic 

disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune and haemopoietic system, expression of this gene at significantly higher 

25 or lower levels may be detected in certain tissues or cell types (e.g., cancerous and 

wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. 

30 The tissue distribution in neutrophils suggests that the protein product of this 

clone would be useful for treatment and diagnosis of disorders of the immune and 
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haemopoietic system. Furthermore, sequence homology of the polynucleotides and 
polypeptides of the present invention with a number of human cytokeratin molecules, 
such as CK-8, CK-15, and CK-17, indicate that molecules of the present invention 
can be used diagnostically as markers of basal cell differentiation in complex epithelia 
5 and therefore indicative of a certain type of epithelial stem cells, as well as markers of 
the differentiation of other cell types such as neutrophils or other immune cells. 
Molecules of the present invention, or agonists or antagonists thereof, can also be 
used therapeutically to treat differentiation disorders of epithelial, neutrophil or other 
immune cell differentiation or activation. Protein, as well as, antibodies directed 

10 against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:66 and may have been publicly available prior to conception of 

1 5 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1269 of SEQ ID NO:66, b 

20 is an integer of 15 to 1283, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:66, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 57 

25 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: EEIATSIEPIRDFLAIVFFASIGLHVFPTFVAYELTVLVF 
LTLSVVV (SEQ ID NO:272). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

It has been discovered that this gene is expressed primarily in synovium, 

30 placenta, and stromal cells, and to a lesser extent in several other tissues and organs, 
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including, among others, bone marrow, palate, pituitary gland, and in tissue derived 
from osteosarcoma and chondrosarcoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: developmental disorders, as well 
as disorders of the musculoskeletal and haematopoietic systems, and cancers 
including especially osteosarcoma and chondrosarcoma, but also other cancers 
including breast cancer, colon cancer, cardiac tumors, pancreatic cancer, melanoma, 
retinoblastoma, glioblastoma, lung cancer, intestinal cancer, testicular cancer, 
stomach cancer, neuroblastoma, myxoma, myoma, lymphoma, endothelioma, 
osteoblastoma, osteoclastoma, adenoma, and the like. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the haemopoietic and musculoskeletal 
systems, as well as developmental disorders, expression of this gene at significantly 
higher or lower levels may be detected in certain tissues or cell types (e.g., synovium, 
placenta, stromal, immune, hematopoietic, skeletal, and cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
164 as residues: Pro-81 to Ser-88. 

The tissue distribution in placenta suggests that the protein product of this 
clone would be useful for treatment and diagnosis of developmental disorders. 
Polynucleotides and polypeptides of the present invention can be used diagnostically 
and therapeutically to detect and treat many cancers, particularly osteosarcoma and 
chondrosarcoma. In addition, the expression of this gene product in synovium would 
suggest a role in the detection and treatment of disorders and conditions affecting the 
skeletal system, in particular osteoporosis, bone cancer, as well as, disorders afflicting 
connective tissues (e.g. arthritis, trauma, tendonitis, chrondomalacia and 
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inflammation), such as in the diagnosis or treatment of various autoimmune disorders 
such as rheumatoid arthritis, lupus, scleroderma, and dermatomyositis as well as 
dwarfism, spinal deformation, and specific joint abnormalities as well as 
chondrodysplasias (i.e. spondyloepiphyseal dysplasia congenita, familial 
5 osteoarthritis, Atelosteogenesis type II, metaphyseal chondrodysplasia type Schmid). 

Moreover, the protein is useful in the detection, treatment, and/or prevention 
of a variety of vascular disorders and condtions, which include, but are not limited to 
miscrovascular disease, vascular leak syndrome, aneurysm, stroke, embolism, 
thrombosis, coronary artery disease, arteriosclerosis, and/or atherosclerosis. Protein, 

10 as well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:67 and may have been publicly available prior to conception of 

15 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1249 of SEQ ID NO:67, b 

20 is an integer of 15 to 1263, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:67, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 58 

25 Contact of cells with supernatant expressing the product of this gene has been 

shown to increase the permeability of the plasma membrane of renal messiaglia cells 
to calcium. Thus it is likely that the product of this gene is involved in a signal 
transduction pathway that is initiated when the product binds a receptor on the surface 
of the plasma membrane of renal and developing cells and tissues.Thus, 

30 polynucleotides and polypeptides have uses which include, but are not limited to, 
activating renal and developing cells and tissues. 
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In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: YCNLQCR (SEQ ID NO:273). Polynucleotides encoding these 
polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in the whole 
5 developing embryo, as well as in ovarian cancer and placenta. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: developmental or reproductive 
diseases and/or disorders, in addition to the following and ovarian cancer, as well as 

10 other cancers including breast cancer, colon cancer, cardiac tumors, pancreatic cancer, 
melanoma, retinoblastoma, glioblastoma, lung cancer, intestinal cancer, testicular 
cancer, stomach cancer, neuroblastoma, myxoma, myoma, lymphoma, endothelioma, 
osteoblastoma, osteoclastoma, osteosarcoma, chondrosarcoma, adenoma, and the like. 
Similarly, polypeptides and antibodies directed to those polypeptides are useful to 

15 provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
developing and fetal system, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., developmental, 
reproductive, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, 

20 amniotic fluid, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 

individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

The tissue distribution in embryonic and ovarian tissue, combined with the 
detected calcium flux activity, suggests that the protein product of this clone would be 

25 useful for tretment and diagnosis of developmental disorders as well as ovarian and 
other cancers. Expression within embryonic tissue and other cellular sources marked 
by proliferating cells suggests this protein may play a role in the regulation of cellular 
division, and may show utility in the diagnosis and treatment of cancer and other 
proliferative disorders. Similarly, developmental tissues rely on decisions involving 

30 cell differentiation and/or apoptosis in pattern formation. Dysregulation of apoptosis 
can result in inappropriate suppression of cell death, as occurs in the development of 
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some cancers, or in failure to control the extent of cell death, as is believed to occur in 
acquired immunodeficiency and certain neurodegenerative disorders, such as spinal 
muscular atrophy (SMA). 

Therefore, the polynucleotides and polypeptides of the present invention are 
5 useful in treating, detecting, and/or preventing said disorders and conditions, in 
addition to other types of degenerative conditions. Thus this protein may modulate 
apoptosis or tissue differentiation and would be useful in the detection, treatment, 
and/or prevention of degenerative or proliferative conditions and diseases. 
Alternatively, the protein is useful in the detection, treatment, and/or prevention of 

10 vascular conditions, which include, but are not limited to, microvascular disease, 
vascular leak syndrome, aneurysm, stroke, atherosclerosis, arteriosclerosis, or 
embolism. Protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. 
Many polynucleotide sequences, such as EST sequences, are publicly 

15 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO: 68 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

20 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1603 of SEQ ID NO:68, b 
is an integer of 15 to 1617, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:68, and where b is greater than or equal to a 
+ 14. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 59 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: SALIGNPKGCFGCFSPVVLREWSVESWKSLRPFQAICK 
LKTNFR (SEQ ID NO: 274). Polynucleotides encoding these polypeptides are also 
30 encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in hypothalamus 
and anergic T cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: neurological and inflammatory 
defects, diseases, and/or disorders. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the central nervous and immune systems, expression of 

10 this gene at significantly higher or lower levels may be detected in certain tissues 

(e.g., neural, immune, hematopoietic, and cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

15 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

166 as residues: His-33 to Trp-38. 

The tissue distribution in hypothalamus and T-cells suggests that the protein 
product of this clone would be useful for study and treatment of immune and nervous 
system disorders. The protein product of this clone is useful for the detection, 

20 treatment, and/or prevention of neurodegenerative disease states, behavioral 
disorders, or inflammatory conditions which include, but are not limited to 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 

25 aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 

30 in normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
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differentiation or survival. Morever, the expression of this gene product suggests a 
role in regulating the proliferation; survival; differentiation; and/or activation of 
hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
5 processes suggesting a usefulness in the treatment of cancer (e.g. by boosting immune 
responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 

10 diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
transplanted organs and tissues, such as host-versus-graft and graft-versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 

15 injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. Moreover, the protein 
may represent a secreted factor that influences the differentiation or behavior of other 
blood cells, or that recruits hematopoietic cells to sites of injury. In addition, this gene 
product may have commercial utility in the expansion of stem cells and committed 

20 progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
25 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:69 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
30 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1375 of SEQ ID NO:69, b 
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is an integer of 15 to 1389, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:69, and where b is greater than or equal to a 
+ 14. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 60 

The translation product of this gene shares nucleotide sequence homology 
with the human PKD1 gene which is thought to be important in polycystic kidney 
disease. 

This gene is expressed widely with a predominant expression exhibited in 

10 liver, pediatric kidney, and in the whole 8 week old developing human embryo. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: cancer, growth, renal, and 
metabolic defects, diseases, and/or disorders. Similarly, polypeptides and antibodies 

15 directed to those polypeptides are useful to provide immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the endocrine, digestive and immune 
systems, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., renal, metabolic, hepatic, developmental, 

20 and cancerous and wounded tissues) or bodily fluids (e.g., lymph, amniotic fluid, bile, 
serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue from an individual not having the disorder. 

The tissue distribution in pediatric kidney suggests that the protein product of 

25 this clone would be useful for study and treatment of renal and general neoplasias and 
growth and development disorders. The protein product of this clone could be used in 
the treatment and/or detection of kidney diseases including renal failure, nephritus, 
renal tubular acidosis, proteinuria, pyuria, edema, pyelonephritis, hydronephritis, 
nephrotic syndrome, crush syndrome, glomerulonephritis, hematuria, renal colic and 

30 kidney stones, in addition to Wilm's Tumor Disease, and congenital kidney 

abnormalities such as horseshoe kidney, polycystic kidney, and Falconi's syndrome. 
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Moreover, the expression within embryonic tissue suggests this protein may 
play a role in the regulation of cellular division, and may show utility in the diagnosis 
and treatment of cancer and other proliferative disorders, particularly of the liver and 
other organs. Similarly, developmental tissues rely on decisions involving cell 
5 differentiation and/or apoptosis in pattern formation. Dysregulation of apoptosis can 
result in inappropriate suppression of cell death, as occurs in the development of some 
cancers, or in failure to control the extent of cell death, as is believed to occur in 
acquired immunodeficiency and certain neurodegenerative disorders, such as spinal 
muscular atrophy (SMA). Therefore, the polynucleotides and polypeptides of the 

10 present invention are useful in treating, detecting, and/or preventing said disorders 

and conditions, in addition to other types of degenerative conditions. Thus this protein 
may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 

15 marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:70 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

20 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1882 of SEQ ID NO:70, b 
is an integer of 15 to 1896, where both a and b correspond to the positions of 

25 nucleotide residues shown in SEQ ID NO:70, and where b is greater than or equal to a 
+ 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 61 

In specific embodiments, polypeptides of the invention comprise the following 
30 amino acid sequence: HEAALRGP (SEQ ID NO:275). Polynucleotides encoding 
these polypeptides are also encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in human striatum 
depression. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: stroke, in addition to other, 

neurologically-related diseases and/or defects. Similarly, polypeptides and antibodies 
directed to those polypeptides are useful to provide immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the central nervous system, expression of 
10 this gene at significantly higher or lower levels may be detected in certain tissues 
(e.g., neural, musculoskeletal, and cancerous and wounded tissues) or bodily fluids 
(e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 
15 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

1 68 as residues: Glu-50 to Glu-61 . 

The tissue distribution in human striatum depression suggests that the protein 
product of this clone would be useful for study and treatment of central nervous 
system orders, such as seizures and other neurological conditions. The protein product 
20 of this clone is useful for the detection, treatment, and/or prevention of 

neurodegenerative disease states, behavioral disorders, or inflammatory conditions 
which include, but are not limited to Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
25 cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 

mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 
30 in normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
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differentiation or survival. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
5 available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:71 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
10 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 294 of SEQ ID NO:71, b 
is an integer of 15 to 308, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:71, and where b is greater than or equal to a 
+ 14. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 62 

This clone has homology to a cystine rich granulin peptide(s) from 
leucocyte(s) which has been termed Granulin E. Granulins inhibit keratinocytes and is 
useful topically for wound healing. The gene encoding the disclosed cDNA is 

20 believed to reside on chromosome 3. Accordingly, polynucleotides related to this 
invention are useful as a marker in linkage analysis for chromosome 3. 

It has been discovered that this gene is expressed primarily in infant brain. 
Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

25 diagnosis of the following diseases and conditions: neurological, developmental, and 
growth defects. Similarly, polypeptides and antibodies directed to those polypeptides 
are useful to provide immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the fetus and the nervous system, expression of this gene at 

30 significantly higher or lower levels may be detected in certain tissues (e.g., neural, 
developmental, growth, and cancerous and wounded tissues) or bodily fluids (e.g., 
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lymph, amniotic fluid, serum, plasma, urine, synovial fluid or spinal fluid) taken from 
an individual having such a disorder, relative to the standard gene expression level, 
i.e., the expression level in healthy tissue from an individual not having the disorder. 
Based on the strong conservation of cysteine residues, the polypeptide of the present 
5 invention can be used to inhibit keratinocytes and promote wound healing. 

The tissue distribution in infant brain suggests that the protein product of this 
clone would be useful for study and treatment of nervous system, neurodegenerative 
and developmental disorders. The protein product of this clone is useful for the 
detection, treatment, and/or prevention of neurodegenerative disease states, 

10 behavioral disorders, or inflammatory conditions which include, but are not limited to 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 

15 compulsive disorder, depression, panic disorder, learning disabilities, ALS, 

psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 
in normal neural function. Potentially, this gene product is involved in synapse 

20 formation, neurotransmission, learning, cognition, homeostasis, or neuronal 

differentiation or survival. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. The homology to granulin proteins suggest the protein product of this clone is 
useful for the treatment, diagnosis, and/or prevention of various skin disorders 

25 including congenital disorders (i.e. nevi, moles, freckles, Mongolian spots, 

hemangiomas, port-wine syndrome), integumentary tumors (i.e. keratoses, Bowen's 
disease, basal cell carcinoma, squamous cell carcinoma, malignant melanoma, Paget' s 
disease, mycosis fungoides, and Kaposi's sarcoma), injuries and inflammation of the 
skin (i.e.wounds, rashes, prickly heat disorder, psoriasis, dermatitis), atherosclerosis, 

30 uticaria, eczema, photosensitivity, autoimmune disorders (i.e. lupus erythematosus, 
vitiligo, dermatomyositis, morphea, scleroderma, pemphigoid, and pemphigus), 
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keloids, striae, erythema, petechiae, purpura, and xanthelasma. In addition, such 
disorders may predispose increased susceptibility to viral and bacterial infections of 
the skin (i.e. cold sores, warts, chickenpox, molluscum contagiosum, herpes zoster, 
boils, cellulitis, erysipelas, impetigo, tinea, althletes foot, and ringworm). Moreover, 
5 the protein product of this clone may also be useful for the treatment or diagnosis of 
various connective tissue disorders such as arthritis, trauma, tendonitis, 
chrondomalacia and inflammation, autoimmune disorders such as rheumatoid 
arthritis, lupus, scleroderma, and dermatomyositis as well as dwarfism, spinal 
deformation, and specific joint abnormalities as well as chondrodysplasias (i.e. 

10 spondyloepiphyseal dysplasia congenita, familial osteoarthritis, Atelosteogenesis type 
II, metaphyseal chondrodysplasia type Schmid). Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

15 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:72 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

20 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1674 of SEQ ID NO:72, b 
is an integer of 15 to 1688, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:72, and where b is greater than or equal to a 
+ 14. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 63 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: SNAAGNVVRAFLYINHLKL GCKVGLA (SEQ ID NO:276). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 
30 It has been discovered that this gene is expressed primarily in prostate cancer 

and dendritic cells. 
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Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: reproductive, immune, and 
hematopoietic diseases, defects and/or disorders. Similarly, polypeptides and 
5 antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the endocrine and immune systems, 
. expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., reproductive, immune, hematopoietic, and cancerous 

10 and wounded tissues) or bodily fluids (e.g., lymph, seminal fluid, serum, plasma, 

urine, synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

15 170 as residues: Trp-47 to Thr-54. 

The tissue distribution in prostate cells and tissues indicates that the protein 
products of this clone are useful for study, diagnosis and treatment of neoplasias, esp. 
of the prostate, and hormonal and metabolic disorders. Moreover, the protein product 
of this clone is useful for the treatment and diagnosis of hematopoietic related 

20 disorders such as anemia, pancytopenia, leukopenia, thrombocytopenia or leukemia 
since stromal cells are important in the production of cells of hematopoietic lineages. 
The uses include bone marrow cell ex- vivo culture, bone marrow transplantation, 
bone marrow reconstitution, radiotherapy or chemotherapy of neoplasia. The gene 
product may also be involved in lymphopoiesis, therefore, it can be used in immune 

25 disorders such as infection, inflammation, allergy, immunodeficiency etc. In addition, 
this gene product may have commercial utility in the expansion of stem cells and 
committed progenitors of various blood lineages, and in the differentiation and/or 
proliferation of various cell types. Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 

30 above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:73 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 124 of SEQ ID NO:73, b 
is an integer of 15 to 1 138, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:73, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 64 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: NWAVLNMLLSKGKITIFLGPLECGS (SEQ ID NO:277). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in B cell 
lymphoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and hematopoietic 
diseases, disorders, and/or defects, particularly cancers. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the hemopoietic and immune systems, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., immune, hematopoietic, and cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 
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The tissue distribution in B cell lymphoma suggests that the protein product of 
this clone would be useful for study and treatment of blood and immune disorders and 
neoplasias, esp. of the lymphatic system. The protein product of this clone is useful 
for the treatment and diagnosis of hematopoietic related disorders such as anemia, 
5 pancytopenia, leukopenia, thrombocytopenia or leukemia since stromal cells are 

important in the production of cells of hematopoietic lineages. The uses include bone 
marrow cell ex- vivo culture, bone marrow transplantation, bone marrow 
reconstitution, radiotherapy or chemotherapy of neoplasia. The gene product may also 
be involved in lymphopoiesis, therefore, it can be used in immune disorders such as 

10 infection, inflammation, allergy, immunodeficiency etc. In addition, this gene product 
may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 

15 tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:74 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

20 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 763 of SEQ ID NO:74, b 
is an integer of 15 to 777, where both a and b correspond to the positions of 

25 nucleotide residues shown in SEQ ID NO: 74, and where b is greater than or equal to a 
+ 14. 



30 



FEATURES OF PROTEIN ENCODED BY GENE NO: 65 

It has been discovered that this gene is expressed primarily in B cell 
lymphoma. 
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Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune and hematopoietic 
diseases, disorders, and/or defects, particularly cancer. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the hemopoietic and immune systems, 
expression of this gene at significantly higher or lower levels may be detected in 
certain tissues or cell types (e.g., immune, hematopoietic, and cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

The tissue distribution in B cell lymphoma suggests that the protein product of 
this clone would be useful for study and treatment of neplasias, esp. of lymphatic 
organs, and immune disorders. The protein product of this clone is useful for the 
treatment and diagnosis of hematopoietic related disorders such as anemia, 
pancytopenia, leukopenia, thrombocytopenia or leukemia since stromal cells are 
important in the production of cells of hematopoietic lineages. The uses include bone 
marrow cell ex- vivo culture, bone marrow transplantation, bone marrow 
reconstitution, radiotherapy or chemotherapy of neoplasia. The gene product may also 
be involved in lymphopoiesis, therefore, it can be used in immune disorders such as 
infection, inflammation, allergy, immunodeficiency etc. In addition, this gene product 
may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:75 and may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
5 general formula of a-b, where a is any integer between 1 to 1046 of SEQ ID NO:75, b 
is an integer of 15 to 1060, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:75, and where b is greater than or equal to a 
+ 14. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 66 

The translation product of this gene shares sequence homology with a rat 
protein phosphatase, in addition to, a human heterogeneous nuclear ribonucleoprotein 
R (See Genbank Accession No.gil2697103 (AF000364)). When tested against PC12 
cell lines, supernatants removed from cells containing this gene activated the EGR1 

15 (early growth response gene 1) promoter element. Thus, it is likely that this gene 

activates sensory neuron cells through the EGR1 signal transduction pathway. EGR1 
is a separate signal transduction pathway from Jak-STAT, genes containing the EGR1 
promoter are induced in various tissues and cell types upon activation, leading the 
cells to undergo differentiation and proliferation. This gene also showed activity in 

20 sensory neurons using the EGR assay described in the Example section. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: PSHQTRKGKSAKLLDRPPEALRMKIITTTLLLACHLQLEV 
GVVVGGEVD (SEQ ID N 0:278), 

FQASSANNQQNWGSQPIAQQPLQQGGDYSG 

25 NYGYNNDNQEFYQDTYGQQWK (SEQ ID NO:279), WXPLLXTSGSPGLXGFG 
TRMNGKEIEGEEIEIVLAKPPDKKRKERQAARQASRSTAYEDYYYHPPPRMPP 
PIRGRGRGGGRGGYGYPPDYYGYEDYYDDYYGYDYHDYRGGYEDPYYGYD 
DGYAVRGRGGGRGGRGAPPPPRGRGAPPPRGRAGYSQRGAPLGPPRGSRGG 
RGGPAQQQRGRGSRGSRGNRGGNVGGKRKADGYNQPDSKRRQPTTNRTGV 

30 PNPSLSSRFSKVVTILVTMVTIMTTRNFIRILMGNSGSRQVRA (SEQ ID 
NO:280), RMNGKEIEGEEIEIVLAKPPDKKRKER (SEQ ID NO:281), YYHPPP 
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RMPP PIRGRGRGGGRGGYG (SEQ ID NO:282), DYRGGYEDPYYGYDDGYAV 
RGRGGGR (SEQ ID NO:283), PPPRGRAGYSQRGAPLGPPRGSRGGRGG (SEQ 
ID NO:284), and/or ADGYNQPDSK RRQPTTNRTGVPNPSLSSRFSKVVT (SEQ 
ID NO: 285). Polynucleotides encoding these polypeptides are also encompassed by 
5 the invention. The gene encoding the disclosed cDNA is believed to reside on 
chromosome 1. Accordingly, polynucleotides related to this invention are useful as 
a marker in linkage analysis for chromosome 1 . 

It has been discovered that this gene is expressed primarily in human primary 
breast cancer, lung, and leukocytes. 

10 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: reproductive, immune, or 
pulmonary diseases and/or disorders, particularly breast cancer. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 

15 immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the reproductive, 
immune and respiratory systems, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., reproductive, 
immune, pulmonary, and cancerous and wounded tissues) or bodily fluids (e.g., 

20 lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

The tissue distribution in breast cancer cells and tissues, in addition to immune 
cells, combined with the homology to a protein phosphatase suggests that the protein 

25 product of this clone would be useful for diagnosis and treatment of breast cancer and 
abnormalities of the lung and the immune system. Morever, the expression of this 
gene product suggests a role in regulating the proliferation; survival; differentiation; 
and/or activation of hematopoietic cell lineages, including blood stem cells. This gene 
product may be involved in the regulation of cytokine production, antigen 

30 presentation, or other processes suggesting a usefulness in the treatment of cancer 
(e.g. by boosting immune responses). 
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Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 
diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
transplanted organs and tissues, such as host-versus-graft and graft-versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 
injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. 

Moreover, the protein may represent a secreted factor that influences the 
differentiation or behavior of other blood cells, or that recruits hematopoietic cells to 
sites of injury. In addition, this gene product may have commercial utility in the 
expansion of stem cells and committed progenitors of various blood lineages, and in 
the differentiation and/or proliferation of various cell types. The protein is useful in 
modulating the immune response to aberrant cells and cell types, particularly 
proliferative cells (e.g. protein may increase the immunogenicity of tumor antigens 
either directly or indirectly, or may activate apoptosis). The protein is useful in 
treating, detecting, and/or preventing various pulmonary disorders, which include, but 
are not limited to, ARDS, emphysema, and cystic fibrosis. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:76 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1489 of SEQ ID NO:76, b 
is an integer of 15 to 1503, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO:76, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 67 
5 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: LQIPPSSQSLGLKNADSSI (SEQ ID NO:286), GGPPESAPW 
LPAVLRAPVLTSRCASSDSEGPVWFCQPGSGPSSTEMSCHCELGPGSSCLCVL 
RGSMWTPSVPGWPQPAKETGASSCSVFSANNGSCPLPLHNHQRQASLDTGL 
SLEHVPGESYFYSPVG (SEQ ID NO:287), SSDSEGPVWFCQPGSGPSSTEMSC 
10 HCILGPGSSC (SEQ ID NO:288), WTPSVPGWPQPAKETGASSCSVFSANNG 
(SEQ ID NO:289), and/or QRQASLDTGL SLEHVPGES YF (SEQ ID NO:290). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in human B cell 
lymphoma. 

15 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune or hematopoietic diseases 
and/or disorders, particularly B cell lymphoma. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 

20 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune system, expression of this 
gene at significantly higher or lower levels may be detected in certain tissues or cell 
types (e.g., immune, hematopoietic, and cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 

25 individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 

The tissue distribution in B-cell lymphoma suggests that the protein product of 
this clone would be useful for diagnosis and treatment of immune or hematopoietic 
diseases and/or disorders, particularly proliferative conditions. Morever, the 

30 expression of this gene product suggests a role in regulating the proliferation; 

survival; differentiation; and/or activation of hematopoietic cell lineages, including 
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blood stem cells. This gene product may be involved in the regulation of cytokine 
production, antigen presentation, or other processes suggesting a usefulness in the 
treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
5 product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 
diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
10 transplanted organs and tissues, such as host-versus-graft and graft-versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 
injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. Moreover, the protein 
may represent a secreted factor that influences the differentiation or behavior of other 
15 blood cells, or that recruits hematopoietic cells to sites of injury. In addition, this gene 
product may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. The uses include bone marrow cell ex- vivo culture, bone marrow 
transplantation, bone marrow reconstitution, radiotherapy or chemotherapy of 
20 neoplasia. The gene product may also be involved in lymphopoiesis, therefore, it can 
be used in immune disorders such as infection, inflammation, allergy, 
immunodeficiency etc. In addition, this gene product may have commercial utility in 
the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Protein, as well as, 
25 antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:77 and may have been publicly available prior to conception of 
30 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
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would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 858 of SEQ ID NO:77, b 
is an integer of 15 to 872, where both a and b correspond to the positions of 
5 nucleotide residues shown in SEQ ID NO:77, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 68 

In specific embodiments, polypeptides of the invention comprise the following 
10 amino acid sequence: SSSLVLTIRSQTLFLASFIHSTSIFCALN (SEQ ID NO:291). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in osteoarthritic 
cartilage. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
15 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: osteoarthritis and other 
bone/cartilage disorders, particularly degenerative conditions. Similarly, polypeptides 
and antibodies directed to those polypeptides are useful to provide immunological 
probes for differential identification of these tissue(s) or cell type(s). For a number of 
20 disorders of the above tissues or cells, particularly of the skelatal system, expression 
of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., skeletal, joint, autoimmune, and cancerous and wounded tissues) or 
bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken 
from an individual having such a disorder, relative to the standard gene expression 
25 level, i.e., the expression level in healthy tissue from an individual not having the 
disorder. 

The tissue distribution in osteoarthritic cartilage suggests that the protein 
product of this clone would be useful for the diagnosis, treatment, and/or prevention 
of osteoarthritis. Moreover, the gene product is useful in the detection and treatment 
30 of disorders and conditions affecting the skeletal system, in particular osteoporosis, 
bone cancer, as well as, disorders afflicting connective tissues (e.g. arthritis, trauma, 
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tendonitis, chrondomalacia and inflammation), such as in the diagnosis or treatment 
of various autoimmune disorders such as rheumatoid arthritis, lupus, scleroderma, and 
dermatomyositis as well as dwarfism, spinal deformation, and specific joint 
abnormalities as well as chondrodysplasias (i.e. spondyloepiphyseal dysplasia 
5 congenita, familial osteoarthritis, Atelosteogenesis type II, metaphyseal 

chondrodysplasia type Schmid). Protein, as well as, antibodies directed against the 
protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

10 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO:78 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

15 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 559 of SEQ ID NO:78, b 
is an integer of 15 to 573, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:78, and where b is greater than or equal to a 
+ 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 69 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
17. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 17. 

25 It has been discovered that this gene is expressed primarily in fetal brain, 

pharynx carcinoma, and Hodgkin's lymphoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: developmental and/or proliferative 

30 diseases and disorders, particularly pharynx carcinoma, and Hodgkin's lymphoma. 
Similarly, polypeptides and antibodies directed to those polypeptides are useful to 
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provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
digestive and immune systems, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., developmental, 
proliferative cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, amniotic fluid, synovial fluid or spinal fluid) taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
176 as residues: Tyr-30 to Ser-40. 

The tissue distribution in pharynx carcinoma and Hodgkin's lymphoma 
suggests that the protein product of this clone would be useful for diagnosis and 
treatment of immune and proliferative conditions. Moreover, expression within fetal 
tissue and other cellular sources marked by proliferating cells suggests this protein 
may play a role in the regulation of cellular division, and may show utility in the 
diagnosis and treatment of cancer and other proliferative disorders. Similarly, 
developmental tissues rely on decisions involving cell differentiation and/or apoptosis 
in pattern formation. Dysregulation of apoptosis can result in inappropriate 
suppression of cell death, as occurs in the development of some cancers, or in failure 
to control the extent of cell death, as is believed to occur in acquired 
immunodeficiency and certain neurodegenerative disorders, such as spinal muscular 
atrophy (SMA). Therefore, the polynucleotides and polypeptides of the present 
invention are useful in treating, detecting, and/or preventing said disorders and 
conditions, in addition to other types of degenerative conditions. Thus this protein 
may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 

Alternatively, the protein product of this clone is useful for the detection, 
treatment, and/or prevention of neurodegenerative disease states, behavioral 
disorders, or inflammatory conditions which include, but are not limited to 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
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trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
5 patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 
in normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
differentiation or survival. Protein, as well as, antibodies directed against the protein 
10 may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:79 and may have been publicly available prior to conception of 

15 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1495 of SEQ ID NO:79, b 

20 is an integer of 15 to 1509, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:79, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 70 
25 The translation product of this gene shares sequence homology with insulin- 

like growth factor binding protein. Moreover, the protein has homology to the human 
Slit- 1 protein (See Genbank Accession No. gnllPIDId 1036 170 (AB017167)), which is 
thought to play an integral role in neural development. In Drosophila embryogenesis, 
the slit gene has been shown to play a critical role in CNS midline formation. Each 
30 Slit gene encodes a putative secreted protein, which contains conserved protein- 
protein interaction domains including leucine-rich repeats (LRR) and epidermal 
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growth factor (EGF)-like motifs, like that of the Drosophila protein. The Slit genes 
form an evolutionary conserved group in vertebrates and invertebrates, and the 
mammalian Slit proteins may participate in the formation and maintenance of the 
nervous and endocrine systems by protein-protein interactions. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: the EGF-like domain: CCCRLGLSGPKC (SEQ ID NO:292); in 
addition to the following: RAFWGLGALQLLDLSANQLEAL (SEQ ID NO:293), 
HASGRRTGSADDGLQGRTGSGPPTAGAGGGGAAP (SEQ ID NO:294), 
VSAAAGARLAPRAPGAPAGCRPMRGCAARAAARKSLVPVLPAGWRSGPAA 
AARPGPRRLAHAPSAARSRAGPGAVARPLPRRHLAAAHGRGCGPAAARAGA 
GSGPGARRAARVPTAGRPPGTHVHTSGQSGAPRDPEGEALADTWAQTGQGD 
SSSNSSSSGRGRDQEGPRMGAAPPPPAPAVGGPLPVRPWSPSSAEPVLRPDAW 
(SEQ ID NO:295), 

TRPAAERAPRTTGSRDAQAAGLPPRVPGAGGLPPCGALPGR 
GLGRCCCCCCCCRLGLSGPKCRPGPRPRGPWAPRTAPRCARACREACQLSAL 
SLPAVPPGLSLRLRALLLDHNRVRALPPGAFAGAGALQRLDLRENGLHSVHV 
RAFWGLGALQLLDLSANQLEALAPGTFAPLRALRNLSLAGNRLARLEPAALG 
ALPLLRSLSLQDNELAALAPGLLGRLPALDALHLRGNPWGCGCALRPLCAWL 
RRHPLPASEAETVLCVWPGRLTLSPLTAFSDAAFSHCAQPLALRDLARGLHA 
RAGLLPRQPGFLPGAGLWAHRLPCAPPPPPHRRPPPAETVQTRTPIPTPTAVPR 
PRTRG APS A A AQ A (SEQ ID NO:296), 

GCRPMRGCAARAAARKSLVPVLPAGWRSGP AAAARPGPRRLAHAPSA (SEQ 
ID NO:297), PGAVARPLPRRHLAAAHGRGCG PAAARAGA (SEQ ID NO:298), 
SGQSGAPRDPEGEALADTWAQTGQ (SEQ ID NO:299), 
PPAPAVGGPLPVRPWSPSSAEPV (SEQ ID NO:300), APRTTGSRD 
AQAAGLPPRVPGAGGLP (SEQ ID NO:301), GPRPRGPWAPRTAPRCARACRE 
(SEQ ID NO:302), AVPPGLSLRLRALLLDHNRVRALPPGAFAGA (SEQ ID 
NO:303), LG ALQLLDLS ANQLE AL APGTF AP (SEQ ID NO: 304), PPGAFAGAG 
ALQRLDLRENGLHSVHVRAFWGLGALQ (SEQ ID NO:305), RNLSLAGNRLA 
RLEP AALGALPLLRSLS (SEQ ID NO:306), LPALDALHLRGNPWGCGCALRP 
LCAW (SEQ ID NO:307), TVLCVWPGRLTLSPLTAFSDAAFSHCAQPLALRD 
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(SEQ ID NO:308), LHARAGLLPRQPGFLPGAGLWAHR (SEQ ID NO:309), 
and/or TVQTRTPIPTPTAVPRPRTRGAPS (SEQ ID NO:310). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in a breast cancer 
5 cell line, MDA36. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: neural, reproductive, and 
proliferative diseases and/or disorders, particularly breast cancer and degenerative 

10 conditions. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the reproductive system, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., neural, reproductive, and 

15 proliferative cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

20 177 as residues: Met-1 to Arg-10, Arg-64 to Ala-71, Gly-124 to Gly-131, Pro- 189 to 
Arg-194, Val-223 to Gly-228. 

The tissue distribution in a breast cancer cells and tissues and homology to 
insulin-like growth factor binding protien suggests that the protein product of this 
clone would be useful for diagnosis and treatment of breast cancer, and other forms of 

25 cancer. Moreover, the homology to the conserved human slit- 1 protein suggests that 
the protein is useful in the treatment, diagnosis, and/or prevention of neural disorders, 
particularly developmental and degenerative conditions. Similarly, the protein is 
useful for the treatment and/or diagnosis of neurodegenerative disease states, 
behavioral disorders, or inflammatory conditions which include, but are not limited to 

30 Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
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trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
5 patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 
in normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
differentiation or survival. Protein, as well as, antibodies directed against the protein 
10 may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:80 and may have been publicly available prior to conception of 

15 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1095 of SEQ ID NO:80, b 

20 is an integer of 15 to 1 109, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:80, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 71 

25 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: HASGRPDRSSAPIGNSGLPCPDLEPLGGLQSKCRLCAPTE 
ARGLWSRSLCSDRCDTWRS (SEQ ID NO:3 1 1 ), and/or GLPCPDLEPLGGLQSK 
CRLCAPTEARGLW (SEQ ID NO:312). Polynucleotides encoding these 
polypeptides are also encompassed by the invention. This gene also maps to 

30 chromosome 1, and therefore can be used in linkage analysis as a marker for 
chromosome 1. 
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It has been discovered that this gene is expressed primarily in salivary gland 
and colon carcinoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: colon carcinoma and other 

digestive system or gastrointestinal diseases and/or disorders. Similarly, polypeptides 
and antibodies directed to those polypeptides are useful to provide immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the digestive system, expression 
10 of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., digestive system, gastrointestinal, metabolic, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, chyme, bile, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
15 tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
178 as residues: Val-34 to Leu-39, Ser-64 to Cys-74, Ser-86 to Ser-95, Arg-128 to 
Ala- 136. 

The tissue distribution in salivary gland and colon carcinoma suggests that the 
20 protein product of this clone would be useful for the treatment and diagnosis colon 
cancer and other digestive system diseases and/or disorders, such as ulcers, and other 
proliferative conditions. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

25 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:81 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

30 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
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general formula of a-b, where a is any integer between 1 to 793 of SEQ ID NO:81, b 
is an integer of 15 to 807, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:81, and where b is greater than or equal to a 
+ 14. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 72 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: QEWESELGERRKPLQA (SEQ ID NO:313). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 
10 It has been discovered that this gene is expressed primarily in 6 week old 

human embryos. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: embryological defects; aberrant 

15 development; aberrant cellular proliferation (e.g. cancers), and other developmentally 
related or proliferative diseases and/or disorders. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the developing human embryo, expression 

20 of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., developmental, and cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, amniotic fluid, urine, synovial fluid or spinal fluid) 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 

25 having the disorder. 

The tissue distribution in 6 week old human embryos suggests that the protein 
product of this clone would be useful for the diagnosis and/or treatment of defects in 
embryonic development. Elevated expression of this gene product in early 6 week 
human embryos suggests that this gene product plays a critical role in normal human 

30 development. Alternatively, this gene product may be involved in the pattern of 
cellular proliferation that accompanies early embryogenesis. Thus, aberrant 
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Expression of this gene product in tissues - particularly adult tissues - may 
correlate with patterns of abnormal cellular proliferation, such as found in various 
cancers. Moreover, this protein may play a role in the regulation of cellular division, 
and may show utility in the diagnosis and treatment of cancer and other proliferative 
disorders. Similarly, developmental tissues rely on decisions involving cell 
differentiation and/or apoptosis in pattern formation. Dysregulation of apoptosis can 
result in inappropriate suppression of cell death, as occurs in the development of some 
cancers, or in failure to control the extent of cell death, as is believed to occur in 
acquired immunodeficiency and certain neurodegenerative disorders, such as spinal 
muscular atrophy (SMA). Therefore, the polynucleotides and polypeptides of the 
present invention are useful in treating, detecting, and/or preventing said disorders 
and conditions, in addition to other types of degenerative conditions. Thus this protein 
may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:82 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1029 of SEQ ID NO: 82, b 
is an integer of 15 to 1043, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:82, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 73 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: CQSSNLIFFQFVNILFNLMMDILVDFSITKMPINSIFSLYF 
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CYEII (SEQ ID NO:314). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

It has been discovered that this gene is expressed primarily in 6 week old 
human embryo. 

5 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: abnormal embryonic 
development; abnormal cellular proliferation; developmental defects, and other 
developmentally related or proliferative diseases and/or conditions. Similarly, 

10 polypeptides and antibodies directed to those polypeptides are useful to provide 

immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the developing 
human embryo, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., developmental, and cancerous and 

15 wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, amniotic fluid, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

The tissue distribution in 6 week old human embryo suggests that the protein 

20 product of this clone would be useful for the diagnosis and treatment of disorders of 
human embryonic development. Expression of this clone in developing embryos 
suggests that it plays a critical role in early human development. Alternatively, it may 
be involved in key cellular proliferation events that occur during embryogenesis. 
Therefore misexpression of this gene in adult tissues may lead to abnormal patterns of 

25 cellular proliferation and cancer. Moreover, expression within embryonic tissue and 
other cellular sources marked by proliferating cells suggests this protein may play a 
role in the regulation of cellular division, and may show utility in the diagnosis and 
treatment of cancer and other proliferative disorders. Similarly, developmental tissues 
rely on decisions involving cell differentiation and/or apoptosis in pattern formation. 

30 Dysregulation of apoptosis can result in inappropriate suppression of cell death, as 
occurs in the development of some cancers, or in failure to control the extent of cell 
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death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
5 of degenerative conditions. Thus this protein may modulate apoptosis or tissue 

differentiation and would be useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

10 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 83 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

15 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 159 of SEQ ID NO:83, b 
is an integer of 15 to 1 173, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:83, and where b is greater than or equal to a 

20 + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 74 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: GPVWLFCFLTLCRKPSQLFSQENSCMDVAGGVTTCLPP 

25 WFSRGAPAQMSQWPPSSDHGAVRAGRDSRVGPVQPSHLTCEGGKEEREKNK 
KAEVNPPTGMGLANRIPRDDITLKLRNQGKLRTKENRTQSAKRHP (SEQ ID 
NO:315), VACKPENRTKTHFASSPACDGHALGGQVGFAICFLSCLFPPM (SEQ 
ID NO:316), and/or SHPMPNTPQKQLLFSEDNELLVSLRTGRKPTLQAALRVTG 
(SEQ ID NO:317). Polynucleotides encoding these polypeptides are also 

30 encompassed by the invention. 
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It has been discovered that this gene is expressed primarily in pleural cancer 
and endometrial tumors, and, to a lesser extent, in bone marrow & apoptotic T cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: pleural cancer; endometrial 
tumors; hematopoietic disorders; immune dysfunction. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the lungs and immune system, expression 
of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., immune, hematopoietic, reproductive, and cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

The tissue distribution in pleural cancer and endometrial tumors indicates that 
the protein products of this clone are useful for the diagnosis and treatment of various 
reproductive cancers, including pleural cancer and endometrial tumors. In addition, 

Expression of this gene product within T cells & bone marrow suggests that it 
may play a role in normal hematopoiesis. Therefore, this gene product may also be 
useful in the diagnosis and/or treatment of a variety of hematopoietic disorders, 
including defects in immune surveillance, inflammation, impaired immune function, 
and T cell lymphomas. Use of this gene product may be appropriate in situations 
designed to affect the proliferation, survival, and/or differentiation of various 
hematopoietic cell lineages, including blood stem cells. 

Moreover, this protein may play a role in the regulation of cellular division, 
and may show utility in the diagnosis and treatment of cancer and other proliferative 
disorders. Similarly, developmental tissues rely on decisions involving cell 
differentiation and/or apoptosis in pattern formation. Dysregulation of apoptosis can 
result in inappropriate suppression of cell death, as occurs in the development of some 
cancers, or in failure to control the extent of cell death, as is believed to occur in 
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acquired immunodeficiency and certain neurodegenerative disorders, such as spinal 
muscular atrophy (SMA). Therefore, the polynucleotides and polypeptides of the 
present invention are useful in treating, detecting, and/or preventing said disorders 
and conditions, in addition to other types of degenerative conditions. Thus this protein 
5 may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

10 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO: 84 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 

15 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 547 of SEQ ID NO: 84, b 
is an integer of 15 to 1561, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 84, and where b is greater than or equal to a 
+ 14. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 75 

The translation product of this gene shares low sequence homology with dreg- 
2, a gene product originally identified in Drosophila that shows an oscillating pattern 
of expression tied into a circadian clock rhythm. 

25 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: 

AHRLQIRLLTWDVKDTLLRLRHPLGEAYATKARAHGLEV 
EPSALEQGFRQAYRAQSHSFPNYGLSHGLTSRQWWLDVVLQTFHLAGVQDA 
QAVAPIAEQLYKDFSHPCTWQVLDGAEDTLRECRTRGLRLAVISNFDRRLEGI 

30 LXGLGLREHFDFVLTSEAAGWPKPDPRIFQEALRLAHMEPVVAAHVGDNYL 
CDYQGPRAVGMHSFLVVGPQALDPVVRDSVPKEHILPSLAHLLPALDCLEGS 
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TPGL (SEQ ID N 0:319), 

EGDPRGRPRPRPLGPPPQLTLPTALXDILRQVRAPGLRLSRA 
LEVGRKGSPIFKIQIYL (SEQ ID NO:318), IRLLTWDVKDTLLRLRHPLGEAYA 
TKA (SEQ ID NO:320), LEQGFRQAYRAQSHSFPNYGLSHG (SEQ ID NO:321), 
5 HLAGVQDAQAVAPIAEQLYKDFSHPC (SEQ ID NO:322), VLDGAEDTLRECR 
TRGLRLAVIS (SEQ ID NO:323), REHFDFVLTSEAAGWPKPDPRIFQEA (SEQ 
ID NO:324), EPVVAAHVGDNYLCDYQGPRAVGMHSFL (SEQ ID NO:325), 
and/or VVRDSVPKEHILPSLAHLLPALD (SEQ ID NO:326). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 
10 K has been discovered that this gene is expressed primarily in tumors of the 

pancreas & thymus and to a lesser extent in a variety of fetal tissues, including fetal 
brain, liver, spleen, and kidney. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
15 diagnosis of the following diseases and conditions: pancreatic cancer; thymic cancer; 
disorders of fetal development; abnormal cellular proliferation; hematopoietic 
disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
20 the pancreas and immune system, expression of this gene at significantly higher or 
lower levels may be detected in certain tissues or cell types (e.g., developmental, 
metabolic, immune, hematopoietic, and cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, amniotic fluid, urine, synovial fluid or spinal fluid) 
taken from an individual having such a disorder, relative to the standard gene 
25 expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

The tissue distribution in proliferative and developmental cells and tissues 
indicates that the protein products of this clone are useful for the diagnosis and 
treatment of cancers, particularly pancreatic and thymic cancer. Expression of this 
30 gene product within various fetal tissues also indicates that it is useful in the diagnosis 
and/or treatment of human developmental disorders. Taken together, the observation 
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that this gene product is expressed in cancers and in fetal tissues indicates that it plays 
a role in proliferation and/or differentiation events that are associated with early 
development. Misexpression of this gene product in adult tissues, therefore, may 
directly contribute to abnormal cellular proliferation and/or dedifferentiation that 
5 accompanies cancer. Finally, 

Moreover, the expression of this gene product in fetal liver/spleen also 
suggests that it plays a role in hematopoiesis, and is useful in the diagnosis and/or 
treatment of a variety of disorders of the immune system. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 

10 immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:85 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

15 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1419 of SEQ ID NO:85, b 
is an integer of 15 to 1433, where both a and b correspond to the positions of 

20 nucleotide residues shown in SEQ ID NO:85, and where b is greater than or equal to a 
+ 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 76 

In specific embodiments, polypeptides of the invention comprise the following 
25 amino acid sequence: IRKLGPGLAPCSCRSGQVFPRV (SEQ ID NO:327). 

Polynucleotides encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in frontal cortex, 
particularly derived from epileptic patients. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
30 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: epilepsy; neurodegenerative 
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diseases and disorders, particularly learning disorders. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the brain, CNS, and/or PNS, expression of 
5 this gene at significantly higher or lower levels may be detected in certain tissues or 
cell types (e.g., neural, and cancerous and wounded tissues) or bodily fluids (e.g., 
lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

10 The tissue distribution in frontal cortex tissue suggests that the protein product 

of this clone would be useful for the diagnosis and/or treatment of disorders of the 
brain and nervous system, particularly epilepsy. Moreover, the expression of this gene 
product suggests that it may play a role in various critical processes of the nervous 
system, including nerve survival, pathfinding, signal conductance, and/or synapse 

15 formation. It may have effects on various processes including homeostasis, learning, 
motor function, language, etc. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
differentiation or survival. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
20 tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 86 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 

25 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1363 of SEQ ID NO:86, b 
is an integer of 15 to 1377, where both a and b correspond to the positions of 

30 nucleotide residues shown in SEQ ID NO:86, and where b is greater than or equal to a 
+ 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 77 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: 

5 KPLRMARPGGPEHNEYALVSAWHSSGSYLDSEGLRHQDD 

FDVSLLVCHCAAPFEEQGEAERHVLRLQFFVVLTSQRELFPRLTADMRRFRK 
PPRLPPEPEAPGSSAGSPGEASGLILAPGPAPLFPPLAAEVGMARARLAQLVRL 
AGGHCRRDTLWKRLFLLEPPGPDRLRLGGRLALAELEELLEAVHAKSIGDIDP 
QLDCFLSMTVSWYQSLIKVLLSRFPRAVAISKAQTWELSTWLR (SEQ ID 
10 NO:328), ARGTLELPTPLIAAHQLYNYVADHASSYHM (SEQ ID NO:329), 
SHCEWPGQG AQNTTSMPWCRHGTVLAPTWTLRDFDTR (SEQ ID NO: 330), 
PLTTVSHLCPL 

SLRVFTSHLDITAGHSHRDDTWVPIPALPLKHLRPPSSPFALGPWVSHPLMRW 
VQKLSHLHSNPGTGFSMGGKSAEKLKC (SEQ ID NO:331), STAARGAPGPGR 

1 5 AGGTPRSSPCQIHWGHRPPAGLLPIHDGLLVPEPDQSSPKPLPQSCRHFQSPDL 
GTQYLVALNQKFTDCSALVFWTPLRKDVSEVVFREALPVQPQDTRSPPAQLV 
STYHHLESVINTACFTLLDPPPLKGVDWTTECHCSLNHGPTRLPARGRTDQPF 
WAPGQARH (SEQ ID NO:332), 

HQRLCNYVLRVCCPSLAAGTALPKHPQPLTHPGL 

20 QRVRSTPRTPWALLGYSFRPPW (SEQ ID NO:333), 
PGGPEHNEYALVS AWHSS GSYLDSEGLR (SEQ ID NO:334), 
D VSLL VCHC A APFEEQGE AERHVLR (SEQ ID NO:335), 
RLTADMRRFRKPPRLPPEPEAPGSSAGS (SEQ ID NO:336), GEASGLI 
LAPGP APLFPPLA AEVGM (SEQ ID NO:3 37), 

25 TLWKRLFLLEPPGPDRLRLGGRL (SEQ ID NO:338), and/or 
LAELEELLEAVHAKSIGDIDPQLDCFLS (SEQ ID NO:339). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in fetal liver/spleen 
and leukocytes, and to a lesser extent in a colon adenocarcinoma cell line. 

30 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
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diagnosis of the following diseases and conditions: hematopoietic disorders; immune 
dysfunction; colon cancer; colorectal adenocarcinoma. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
5 of the above tissues or cells, particularly of the immune system and colon, expression 
of this gene at significantly higher or lower levels may be detected in certain tissues 
or cell types (e.g., hematopoietic, immune, gastrointestinal, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 
10 gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
184 as residues: Leu- 16 to Ser-23, Ser-38 to Pro-43, Gly-53 to Leu-60. 

The tissue distribution in colon adenocarcinoma suggests that the protein 
15 product of this clone would be useful for the diagnosis and/or treatment of 

gastrointestinal diseases and/or disorders, particularly proliferative conditions. 
Expression of this gene product in fetal and proliferative cells and tissues suggests 
that it may be a marker cancers, and that it's misregulated expression may in fact 
contribute to the development or progression of the types of cancers dictated by its 
20 expression. 

Similarly, the expression of this gene product in fetal liver/spleen - a primary 
site of early hematopoiesis - taken together with its expression in peripheral blood 
leukocytes suggests that this gene product may play a role in a variety of 
hematopoietic processes, including the survival, proliferation, activation, and/or 

25 differentiation of all blood cell lineages, including the totipotent hematopoietic stem 
cell. Such a gene product may therefore play a role in a variety of hematopoietic 
disorders including inflammation; immune dysfunction; defects in immune 
surveillance; and hematopoietic cancers and lymphomas. Similarly, developmental 
tissues rely on decisions involving cell differentiation and/or apoptosis in pattern 

30 formation. Dysregulation of apoptosis can result in inappropriate suppression of cell 
death, as occurs in the development of some cancers, or in failure to control the extent 
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of cell death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). 

Therefore, the polynucleotides and polypeptides of the present invention are 
useful in treating, detecting, and/or preventing said disorders and conditions, in 
addition to other types of degenerative conditions. Thus this protein may modulate 
apoptosis or tissue differentiation and would be useful in the detection, treatment, 
and/or prevention of degenerative or proliferative conditions and diseases. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 87 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1701 of SEQ ID NO:87, b 
is an integer of 15 to 1715, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 87, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 78 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
20. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 20. 

It has been discovered that this gene is expressed primarily in brain. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: neurodegenerative diseases and/or 
disorders. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
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or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the central nervous system, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., neural, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
5 spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. This gene is believed to reside on chromosome 20, D20S 111- 
D20S195. Polynucleotides corresponding to this gene are useful, therefore, as 
chromosome markers. 

10 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

185 as residues: Met-1 to Tyr-6, Thr-38 to Ala-44. 

The tissue distribution in brain tissue indicates that the protein products of this 
clone are useful for diagnosis and treatment of disorders of the central nervous 
system. Moreover, the protein product of this clone is useful for the detection, 
15 treatment, and/or prevention of neurodegenerative disease states, behavioral 
disorders, or inflammatory conditions which include, but are not limited to 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
20 aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. 

In addition, elevated expression of this gene product in regions of the brain 
25 suggests it plays a role in normal neural function. Potentially, this gene product is 
involved in synapse formation, neurotransmission, learning, cognition, homeostasis, 
or neuronal differentiation or survival. Protein, as well as, antibodies directed against 
the protein may show utility as a tumor marker and/or immunotherapy targets for the 
above listed tissues. 

30 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:88 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 403 of SEQ ID NO:88, b 
is an integer of 15 to 417, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:88, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 79 

When tested against U937 cell lines, supernatants removed from cells 
containing this gene activated the GAS (gamma activating sequence) promoter 
element. Thus, it is likely that this gene activates myeloid cells, and to a lesser extent, 

15 other immune and hematopoietic cells or cell types, through the JAK-STAT signal 
transduction pathway. GAS is a promoter element found upstream of many genes 
which are involved in the Jak-STAT pathway. The Jak-STAT pathway is a large, 
signal transduction pathway involved in the differentiation and proliferation of cells. 
Therefore, activation of the Jak-STAT pathway, reflected by the binding of the GAS 

20 element, can be used to indicate proteins involved in the proliferation and 
differentiation of cells. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: FQLYFNPELIFKHFQIWRLITNFLFFGPVGFNFLFNMIFLY 
RYCRMLEEGSFRGRTADFVFMFLFGGFLMTLFGLFVSLVFLGQAFTIMLVYV 

25 WSRXNPYVRMNFFGLLNFQAPFLPWVLMGFSLLLGNSIIVDLLGIAVGHIYFF 
LEDVFPNQPGGIRILKTPSILKAIFDTPDEDPNYNPLPEERPGGFAWGEGQ SEQ 
I D N O : 3 4 0 ) , 

GVGQATVGKMAYQSLRLEYLQIPPVSRAYTTACVLTTAAVQLELITPF 
QLYFNPELIFKHFQrWRl^ITNFLFFGPVGFNFLFNMIFLYRYCRMLEEGSFRGR 

30 TADFVF (SEQ ID NO:341), LIFKHFQIWRLITNFLFFGPVGF (SEQ ID NO:342), 
FLYRYCRMLEEGSFRGRTADFVFMF (SEQ ID NO:343), LVFLGQAFTIMLVYV 
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WSRXNPYV (SEQ ID NO:344), VLMGFSLLLGNSIIVDLLGIA (SEQ ID NO:345), 
NQPGGIRILKTPSILKAIFDTPDED (SEQ ID NO:346), RLEYLQIPPVSRAYTTAC 
VLTTAAVQLE (SEQ ID NO:347), and/or RLITNFLFFGPVGFNFLFNMIFLYRYC 
RMLE (SEQ ID NO:348). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. The gene encoding the disclosed cDNA is believed to 
reside on chromosome 17. Accordingly, polynucleotides related to this invention are 
useful as a marker in linkage analysis for chromosome 17. 

It has been discovered that this gene is expressed primarily in smooth muscle, 
fetal brain, fetal liver and to a lesser extent in activated macrophage, colon cancer. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: developmental diseases, immune- 
related diseases, neural disorders, and vascular diseases and conditions. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system 
and central nervous system, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., developmental, vascular, 
immune, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, amniotic fluid, urine, synovial fluid or spinal fluid) taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 

The tissue distribution in fetal liver, macrophage, and fetal brain indicates that 
the protein products of this clone are useful for treating and diagosis of immune 
system-related diseases and CNS diseases. Moreover, the protein product of this clone 
is useful for the treatment and diagnosis of hematopoietic related disorders such as 
anemia, pancytopenia, leukopenia, thrombocytopenia or leukemia since stromal cells 
are important in the production of cells of hematopoietic lineages. The uses include 
bone marrow cell ex- vivo culture, bone marrow transplantation, bone marrow 
reconstitution, radiotherapy or chemotherapy of neoplasia. The gene product may also 
be involved in lymphopoiesis, therefore, it can be used in immune disorders such as 
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infection, inflammation, allergy, immunodeficiency etc. In addition, this gene product 
may have commercial utility in the expansion of stem cells and committed 
progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. Alternatively, the protein is useful in the detection, treatment, 
5 and/or prevention of vascular conditions, which include, but are not limited to, 
microvascular disease, vascular leak syndrome, aneurysm, stroke, atherosclerosis, 
arteriosclerosis, or embolism. 

Moreover, the expression within fetal tissue and other cellular sources marked 
by proliferating cells, combined with the GAS biological activity, suggests this 
10 protein may play a role in the regulation of cellular division, and may show utility in 
the diagnosis and treatment of cancer and other proliferative disorders. Similarly, 
developmental tissues rely on decisions involving cell differentiation and/or apoptosis 
in pattern formation. Dysregulation of apoptosis can result in inappropriate 
suppression of cell death, as occurs in the development of some cancers, or in failure 
15 to control the extent of cell death, as is believed to occur in acquired 

immunodeficiency and certain neurodegenerative disorders, such as spinal muscular 
atrophy (SMA). Therefore, the polynucleotides and polypeptides of the present 
invention are useful in treating, detecting, and/or preventing said disorders and 
conditions, in addition to other types of degenerative conditions. Thus this protein 
20 may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
25 immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 89 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
30 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
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are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 153 of SEQ ID NO:89, b 
is an integer of 15 to 1 167, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:89, and where b is greater than or equal to a 
5 + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 80 

The translation product of this gene shares sequence homology with 
proacrosin binding proteins (sp32) from non-human mammalian species. The binding 

10 of sp32 to proacrosin may be involved in packaging the acrosin zymogen into the 
acrosomal matrix. See, for example, J Biol Chem. 1994 Apr 1; 269(13): 10133- 
10140, incorporated herein by reference. Accordingly, the inventors have termed the 
translation product of this gene human sp32 or ,, h-sp32". Contact of cells with 
supernatant expressing the product of this gene has been shown to increase the 

15 permeability of the plasma membrane of PMN to calcium. Thus it is likely that the 

product of this gene is involved in a signal transduction pathway that is initiated when 
the product binds a receptor on the surface of the plasma membrane of both 
neutrophils, and to a lesser extent in other immune and hematopoietic cells. Thus, 
polynucleotides and polypeptides have uses which include, but are not limited to, 

20 activating 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: HAS AGPDGSSPA (SEQ ID NO:349), 
ELLLEKPKPWQPPAAAPHRALLVLCYSIVENTCIITPTAKAWKYMEEEILGFG 
KSVCDSLGRRHMSTCALCDFCSLKLEQCHSEASLQRQQCDTSHKTPFAAPCL 
25 PPRACPSATR (SEQ ID NO:350), 

LPGWGFPTKICDTDYIQYPNYCSFKSQQCLMR 

NRNRKVSRMRCLQNETYSALSPGKSEDVVLRWSQEFSTLTLGQFG (SEQ ID 
NO:35 1), SPVLLPAFPPLPVPLLALPVSAPLPACVLVSAPACAPLLAPACAL 
ALAPGFPGTRRIVGALPRCC (SEQ ID NO:352), LLVLCYSIVENTCIITPTAK 
30 AWKYMEEEILGFGKS (SEQ ID NO:353), and/or LKLEQCHSEASLQRQQC 
DTSHKTPFA (SEQ ID NO:354). Polynucleotides encoding these polypeptides are 
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also encompassed by the invention. The gene encoding the disclosed cDNA is 
believed to reside on chromosome 12. Accordingly, polynucleotides related to this 
invention are useful as a marker in linkage analysis for chromosome 12. 

It has been discovered that this gene is expressed primarily in testis. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: reproductive disorders. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the reproductive 
diseases, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., reproductive, testis, prostate, epidiymus, 
and cancerous and wounded tissues) or bodily fluids (e.g., lymph, seminal fluid, 
serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue from an individual not having the disorder. This gene is 
believed to map to chromosome 12 and is thought to be useful as a chromosome 
marker. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
187 as residues: Asp-27 to Ser-32, Pro-52 to Thr-58, Arg-63 to Asn-70, Gln-78 to 
Gly-83, Thr-107 to Asn-1 13, Thr-160 to Val-176, Ser-188 to Gly-241, Leu-248 to 
Pro-265, Tyr-302 to Gly-314. 

The tissue distribution in testis, combined with the specific homology to the 
sp32 protein indicates that the protein products of this clone are useful for the 
diagnosis, treating, and/or prevention of reproductive diseases and/or disorders. 
Moreover, polynucleotides and polypeptides corresponding to this gene are useful for 
the treatment and diagnosis of conditions concerning proper testicular function (e.g. 
endocrine function, sperm maturation), as well as cancer. Therefore, this gene product 
is useful in the treatment of male infertility and/or impotence. This gene product is 
also useful in assays designed to identify binding agents, as such agents (antagonists) 
are useful as male contraceptive agents. 
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Similarly, the protein is believed to be useful in the treatment and/or diagnosis 
of testicular cancer. The testes are also a site of active gene expression of transcripts 
that may be expressed, particularly at low levels, in other tissues of the body. 
Therefore, this gene product may be expressed in other specific tissues or organs 
5 where it may play related functional roles in other processes, such as hematopoiesis, 
inflammation, bone formation, and kidney function, to name a few possible target 
indications. The protein is useful in application and utility as a contraceptive, either 
directly or indirectly. Based upon the detected calcium flux activity, the protein may 
also be useful as an effect treatment for infertility (i.e. for inhibiting autoimmune 

10 disorders). Protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:90 and may have been publicly available prior to conception of 

15 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1878 of SEQ ID NO:90, b 

20 is an integer of 15 to 1892, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:90, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 81 

25 The translation product of this contig has consistent sequence homology with 

a number of previously described viral tat proteins (see, for example, Stevens, et al., J. 

Virol. 64:3716-3725 (1990), which is hereby incorporated by reference, herein). 

In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: QVSGLILSLSCGMDGLALDGSPSPSPXTEKAGRCISQTSL 
30 (SEQ ID NO:355), QVSGLILSLSCGMDGLALDGSPSPSPXTEKAGRCISQTSLP 

GKWEV (SEQ ID NO:356), RASKTVPRMPPNWPAKMPCLCHIRTVEHLGTIS 
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SGAPGRPTGQQAARTYHICWIHPGQKIDSLPPSSQHPRSQQLAPGTWPSTSTT 
KPAEETLGSSASLPISQARKSEKCTFQPSPWXVRGKESHQVPAHPSHRTETES 
D HSPVRKPPSRGTRTGDFTVGDWSEAWLLELALL (SEQ ID NO:357), RMPPN 
WPAKMPCLCHIRTVEHLG (SEQ ID NO:358), GRPTGQQAARTYHICWIHPG 

5 QKIDS (SEQ ID NO:359), WPSTSTTKPAEETLGSSASLPISQA (SEQ ID NO:360), 
KSEKCTFQPSPWXVRGKESHQVP (SEQ ID NO:361), and/or KPPSRGTRTGDF 
TVGDWSEAWLLE (SEQ ID NO:362). Polynucleotides encoding these polypeptides 
are also encompassed by the invention. 

It has been discovered that this gene is expressed almost exclusively in 

10 neutrophils. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of immune disorders. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differential 

15 identification of the tissue(s) or cell type(s). For a number of disorders of the immune 
system, expression of this gene at significantly higher or lower levels may be detected 
in certain tissues or cell types (e.g., immune, hematopoietic, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 

20 gene expression level, i.e., the expression level in healthy tissue from an individual 

not having the disorder. In addition, molecules of the present invention can be used to 
regulate transcription and translation of genes in cells of the immune system, as well 
as in other cell types. Such transcriptional and translation regulation is useful for 
diagnosing and treating a number of disorders in which an alterred state of 

25 transcription and translation may be a factor in the disorder. Such disorders include 
many viral infections, particularly of immune cells, including HIV-1, HIV-2, human 
T-cell lymphotropic virus (HTLV)-I, and HTLV-II, as well as other DNA and RNA 
viruses such as herpes simplex virus (HSV)-l, HSV-2, HSV-6, cytomegalovirus 
(CMV), Epstein-Barr virus (EBV), herpes samirii, adenoviruses, rhinoviruses, 

30 influenza viruses, reoviruses, and the like. In addition, the ability to use molecules of 
the present invention to molecularly regulate the processes of transcription and 
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translation is useful in the diagnosis and treatment of many types of cancers, 
particularly those of the immune system, including ovarian cancer, breast cancer, 
colon cancer, cardiac tumors, pancreatic cancer, melanoma, retinoblastoma, 
glioblastoma, lung cancer, intestinal cancer, testicular cancer, stomach cancer, 
5 neuroblastoma, myxoma, myoma, lymphoma, endothelioma, osteoblastoma, 
osteoclastoma, osteosarcoma, chondrosarcoma, adenoma, and the like. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
188 as residues: Gln-2 to Trp-12, Ala-30 to Glu-35, Gln-42 to Ser-51. 

The tissue distribution in neutrophils, combined with the homology to viral tat 

10 proteins suggests that the protein product of this clone is useful for the diagnosis and 
treatment of immune disorders, particularly viral infections and proliferative 
disorders. Further, since this clone has a high degree of sequence relatedness to 
factors which are involved in the regulation of transcription and translation, this clone 
is useful as a regulator of such processes. Protein, as well as, antibodies directed 

15 against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:91 and may have been publicly available prior to conception of 

20 the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 509 of SEQ ID NO:91, b 

25 is an integer of 15 to 523, where both a and b correspond to the positions of 

nucleotide residues shown in SEQ ID NO:91, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 82 

30 The translation product of this contig has clear sequence identity with a 

number of thioredoxins and endoplasmic reticulum resident proteins (see, for 
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example, Shorrosh and Dixon, Plant J. 2:51-58 (1992), which is hereby incorporated 
by reference, herein). 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: PCADCLSAWA (SEQ ID NO: 363). Polynucleotides encoding 
5 these polypeptides are also encompassed by the invention.The gene encoding the 
disclosed cDNA is believed to reside on chromosome 5. Accordingly, 
polynucleotides related to this invention are useful as a marker in linkage analysis for 
chromosome 5. 

It has been discovered that this gene is expressed primarily in adipocytes and 
10 striatum depression, and in lower abundance in prostate, whole brain, fetal liver, and 
spleen. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: Prostate cancer, CNS diseases, 

15 immune disorders . Similarly, polypeptides and antibodies directed to those 

polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., neural, hematopoietic, 

20 immune, and cancerous and wounded tissues) or bodily fluids (e.g., seminal fluid, 
amniotic fluid, serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue from an individual not having the disorder. 
Since the translation product of this clone has a high degree of sequence relatedness 

25 to many thioredoxins, it can be used as a food additive to improve flour quality or to 
suppress the anti-nutritional effects of leguminous plants. Molecules of the present 
invention can further used to inactivate toxins, for example, bee or snake venom. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
189 as residues: Trp-43 to Ala-49, Pro-68 to Ala-74, Glu-100 to Gly-1 11, Glu-120 to 

30 Asn-125, Pro-141 to Ala-154, Asp-157 to Lys-171, Cys-177 to Ile-182, Ser-248 to 
Leu-253, Thr-280 to Glu-285, Gly-353 to Val-359. 
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The tissue distribution in whole brain suggests that the protein product of this 
clone would be useful for the detection, treatment, and/or prevention of 
neurodegenerative disease states, behavioral disorders, or inflammatory conditions 
which include, but are not limited to Alzheimer's Disease, Parkinson's Disease, 
5 Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 
mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
10 disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 
in normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
differentiation or survival. The secreted protein can also be used to determine 
15 biological activity, to raise antibodies, as tissue markers, to isolate cognate ligands or 
receptors, to identify agents that modulate their interactions, and as nutritional 
supplements. It may also have a very wide range of biological activities. Typical of 
these are cytokine, cell proliferation/differentiation modulating activity or induction 
of other cytokines; immunostimulating/immunosuppressant activities (e.g. for treating 
20 human immunodeficiency virus infection, cancer, autoimmune diseases and allergy); 
regulation of hematopoiesis (e.g. for treating anemia or as adjunct to chemotherapy); 
stimulation or growth of bone, cartilage, tendons, ligaments and/or nerves (e.g. for 
treating wounds, stimulation of follicle stimulating hormone (for control of fertility); 
chemotactic and chemokinetic activities (e.g. for treating infections, tumors); 
25 hemostatic or thrombolytic activity (e.g. for treating hemophilia, cardiac infarction 
etc.); anti-inflammatory activity (e.g. for treating septic shock, Crohn's disease); as 
antimicrobials; for treating psoriasis or other hyperproliferative diseases; for 
regulation of metabolism, and behavior. Also contemplated is the use of the 
corresponding nucleic acid in gene therapy procedures. Protein, as well as, antibodies 
30 directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 
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Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:92 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
5 excluded from the scope of the present invention. To list every related sequence 

would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1368 of SEQ ID NO:92, b 
is an integer of 15 to 1382, where both a and b correspond to the positions of 
10 nucleotide residues shown in SEQ ID NO:92, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 83 

When tested against TF-1 cell lines, supernatants removed from cells 
15 containing this gene activated the ISRE (interferon-sensitive responsive element ) 
promoter element. Thus, it is likely that this gene activates myeloid cells, and to a 
lesser extent, in immune and hematopoietic cells or tissues, through the JAK-STAT 
signal transduction pathway. ISRE is a promoter element found upstream in many 
genes which are involved in the Jak-STAT pathway. The Jak-STAT pathway is a 
20 large, signal transduction pathway involved in the differentiation and proliferation of 
cells. Therefore, activation of the Jak-STAT pathway, reflected by the binding of the 
ISRE element, can be used to indicate proteins involved in the proliferation and 
differentiation of cells. 

In specific embodiments, polypeptides of the invention comprise the following 
25 amino acid sequence: HAS G YLCI VLL (SEQ ID NO:364). Polynucleotides encoding 
these polypeptides are also encompassed by the invention. 

It has been discovered that this gene is expressed exclusively in Rejected 
Kidney. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
30 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: kidney and other urinary tract 
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disorders and disorders related to, or resulting from, transplantation. Similarly, 
polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune and 
5 renal systems, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., renal, kidney, urogenital, immune, 
hematopoietic, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
* serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 

10 level in healthy tissue from an individual not having the disorder. Molecules of the 
present invention are particularly useful in the diagnosis and treatment of disorders 
related to transplantation, particularly kidney transplantation. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
190 as residues: Asn-49 to Gln-54, Glu-150 to Asp- 159. 

15 The tissue distribution in rejected kidney tissue suggests that the protein 

product of this clone would be useful for diagnosis and treatment of disorders related 
to or resulting from rejection of transplanted organs, particularly the kidney. 
Moreover, the protein product of this clone could be used in the treatment and/or 
detection of kidney diseases including renal failure, nephritus, renal tubular acidosis, 

20 proteinuria, pyuria, edema, pyelonephritis, hydronephritis, nephrotic syndrome, crush 
syndrome, glomerulonephritis, hematuria, renal colic and kidney stones, in addition to 
Wilm's Tumor Disease, and congenital kidney abnormalities such as horseshoe 
kidney, polycystic kidney, and Falconi's syndrome. Considering the tissue distribution 
and detected ISRE biological activity, the protein is useful in modulating the immune 

25 response to aberrant kidney proteins, including autoantigens and aberrant proteins 
which are often present in degenerative and proliferative conditions. Protein, as well 
as, antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 

30 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO: 93 and may have been publicly available prior to conception of 
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the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1733 of SEQ ID NO:93, b 
is an integer of 15 to 1747, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:93, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 84 

The translation product of this gene shares sequence homology with the 
conserved MAL and plasmolipin protein (Magyar, et al, Gene 189:269-275 (1997); 
See Genbank Accession No.gnllPIDIe 183885), which are thought to be important in 
modulating T cell function, and proper CNS function, respectively. When tested 
against Jurkat cell lines, supernatants removed from cells containing this gene 
activated the GAS (gamma activating sequence) promoter element. Thus, it is likely 
that this gene activates myeloid cells, and to a lesser extent, immune or hematopoietic 
cells and tissues, through the JAK-STAT signal transduction pathway. GAS is a 
promoter element found upstream of many genes which are involved in the Jak-STAT 
pathway. The Jak-STAT pathway is a large, signal transduction pathway involved in 
the differentiation and proliferation of cells. Therefore, activation of the Jak-STAT 
pathway, reflected by the binding of the GAS element, can be used to indicate 
proteins involved in the proliferation and differentiation of cells. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: NSARAARAEIVLGLLVWTLIAGTEYFRVPAFGWV (SEQ 
ID NO:365). Polynucleotides encoding these polypeptides are also encompassed by 
the invention. 

It has been discovered that this gene is expressed primarily in T cells. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of immune, hematopoietic, and neural diseases and/or disorders. Similarly, 
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polypeptides and antibodies directed to those polypeptides are useful to provide 
immunological probes for differential identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be detected in 
5 certain tissues or cell types (e.g., immune, hematopoietic, and cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. Nucleic acids of the present invention are useful as probes for 

10 detecting traumatic and pathological changes in the central and peripheral nervous 

systems. Molecules of the present invention may be involved in regulating the growth 
of Schwann cells and other neural cells. Molecules of the present invention are also 
useful as modulators of the interaction between Schwann cells and other neural cells 
and the extracellular matrix and is therefore useful for the therapeutic intervention in 

1 5 nerve damage primarily by facilitating regeneration of damaged axons and 
regenerating nerve cells in damaged nervous system tissues. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
191 as residues: Ser-58 to His-64. 

The tissue distribution in T-cells, combined with the homology to the MAL 

20 and plasmolipin proteins and the detected GAS biological activity suggests that the 
protein product of this clone would be useful for the diagnosis and treatment of 
immune disorders including, but not limited to, AIDS and other immunodeficiencies. 
Morever, the expression of this gene product suggests a role in regulating the 
proliferation; survival; differentiation; and/or activation of hematopoietic cell 

25 lineages, including blood stem cells. This gene product may be involved in the 

regulation of cytokine production, antigen presentation, or other processes suggesting 
a usefulness in the treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 

30 agent for immunological disorders including arthritis, asthma, leukemia, rheumatoid 
arthritis, granulomatous disease, inflammatory bowel disease, sepsis, acne, 
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neutropenia, neutrophilia, psoriasis, hypersensitivities, such as T-cell mediated 
cytotoxicity; immune reactions to transplanted organs and tissues, such as host- 
versus-graft and graft-versus-host diseases, or autoimmunity disorders, such as 
autoimmune infertility, lense tissue injury, demyelination, systemic lupus 
erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's 
disease, scleroderma and tissues. Moreover, the protein may represent a secreted 
factor that influences the differentiation or behavior of other blood cells, or that 
recruits hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of 
various blood lineages, and in the differentiation and/or proliferation of various cell 
types. 

The secreted protein can also be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions, and as nutritional supplements. It may also have a 
very wide range of biological activities. Typical of these are cytokine, cell 
proliferation/differentiation modulating activity or induction of other cytokines; 
immunostimulating/immunosuppressant activities (e.g. for treating human 
immunodeficiency virus infection, cancer, autoimmune diseases and allergy); 
regulation of hematopoiesis (e.g. for treating anemia or as adjunct to chemotherapy); 
stimulation or growth of bone, cartilage, tendons, ligaments and/or nerves (e.g. for 
treating wounds, stimulation of follicle stimulating hormone (for control of fertility); 
chemotactic and chemokinetic activities (e.g. for treating infections, tumors); 
hemostatic or thrombolytic activity (e.g. for treating hemophilia, cardiac infarction 
etc.); anti-inflammatory activity (e.g. for treating septic shock, Crohn's disease); as 
antimicrobials; for treating psoriasis or other hyperproliferative diseases; for 
regulation of metabolism, and behavior. Also contemplated is the use of the 
corresponding nucleic acid in gene therapy procedures. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO:94 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
5 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 586 of SEQ ID NO:94, b 
is an integer of 15 to 600, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:94, and where b is greater than or equal to a 
+ 14. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 85 

The translation product of this clone has sequence identity to a protein 
tyrosine kinase reported by Oates and Wilks (The Worm Breeders Gazette 14:87-87 
(1995), which is hereby incorporated by reference herein). The gene encoding the 
15 disclosed cDNA is believed to reside on chromosome 2. Accordingly, 

polynucleotides related to this invention are useful as a marker in linkage analysis for 
chromosome 2. 

It has been discovered that this gene is expressed primarily in cerebellum, 
adult brain, retina, spinal cord, and kidney cortex. 

20 Therefore, nucleic acids of the invention are useful as reagents for differential 

identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: neural, visual, and renal diseases 
and/or disorders. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 

25 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the CNS, retina, and kidney cortex. Expression of this gene at 
significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., neural, visual, renal, and cancerous and wounded tissues) or bodily fluids (e.g., 
lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 

30 having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue from an individual not having the disorder. 
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The tissue distribution in cerebellum, adult brain, and spinal cord tissue 
suggests that the protein product of this clone would be useful for the diagnosis and 
treatment of neural diseases and disorders. The protein product of this clone is useful 
for the detection, treatment, and/or prevention of neurodegenerative disease states, 
behavioral disorders, or inflammatory conditions which include, but are not limited to 
Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Tourette Syndrome, 
meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, neoplasia, 
trauma, congenital malformations, spinal cord injuries, ischemia and infarction, 
aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, obsessive 
compulsive disorder, depression, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated 

Expression of this gene product in regions of the brain suggests it plays a role 
in normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
differentiation or survival. Moreover, the protein product of this clone could be used 
in the treatment and/or detection of kidney diseases including renal failure, nephritus, 
renal tubular acidosis, proteinuria, pyuria, edema, pyelonephritis, hydronephritis, 
nephrotic syndrome, crush syndrome, glomerulonephritis, hematuria, renal colic and 
kidney stones, in addition to Wilm's Tumor Disease, and congenital kidney 
abnormalities such as horseshoe kidney, polycystic kidney, and Falconi's syndrome. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:95 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 572 of SEQ ID NO:95, b 
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is an integer of 15 to 586, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:95, and where b is greater than or equal to a 
+ 14. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 86 

The translation product of this clone has homology to trkB, and it is thought 
that the protein of the present invention is a novel novel neural receptor protein- 
tyrosine kinase, a trkB homolog (See for example, ). This protein is likely to be 
derived from a gene for a ligand-regulated receptor closely related to the human trk 
10 oncogene. Northern (RNA) analysis showed that the trkB gene is expressed 

predominantly in the brain and that trkB expresses multiple mRNAs, ranging from 0.7 
to 9 kb. Hybridization of cerebral mRNAs with a variety of probes indicates that there 
are mRNAs encoding truncated trkB receptors. 

In specific embodiments, polypeptides of the invention comprise the sequence 
1 5 PCSPPDSPPLPGAFVWRVLWVC (SEQ ID NO:366). Polynucleotides encoding this 
polypeptide are also encompassed by the invention. 

It has been discovered that this gene is expressed primarily in breast cancer, 
colon tumor, and B-cell lymphoma. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
20 identification of the tissue(s) or cell type(s) present in a biological sample and for 

diagnosis of the following diseases and conditions: breast cancer, colon tumor, B-cell 
lymphoma. Similarly, polypeptides and antibodies directed to those polypeptides are 
useful to provide immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
25 the immune, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., neural, gastrointestinal, immune, and 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
30 tissue from an individual not having the disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
193 as residues: Ser-29 to Asn-40. 

The tissue distribution in proliferative cells and tissues suggests that the 
protein product of this clone would be useful for the treatment, detection, and/or 
5 prevention of cancer, particularly in the indicated tissues. The expression within 

cellular sources marked by proliferating cells suggests this protein may play a role in 
the regulation of cellular division, and may show utility in the diagnosis and treatment 
of cancer and other proliferative disorders. Similarly, developmental tissues rely on 
decisions involving cell differentiation and/or apoptosis in pattern formation. 
10 Dysregulation of apoptosis can result in inappropriate suppression of cell death, as 
occurs in the development of some cancers, or in failure to control the extent of cell 
death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
15 detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and would be useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. 

Alternatively, the homology to the trkB protein suggests the protein product of 
20 this clone is useful for the detection, treatment, and/or prevention of 

neurodegenerative disease states, behavioral disorders, or inflammatory conditions 
which include, but are not limited to Alzheimer's Disease, Parkinson's Disease, 
Huntington's Disease, Tourette Syndrome, meningitis, encephalitis, demyelinating 
diseases, peripheral neuropathies, neoplasia, trauma, congenital malformations, spinal 
25 cord injuries, ischemia and infarction, aneurysms, hemorrhages, schizophrenia, 

mania, dementia, paranoia, obsessive compulsive disorder, depression, panic disorder, 
learning disabilities, ALS, psychoses, autism, and altered behaviors, including 
disorders in feeding, sleep patterns, balance, and perception. In addition, elevated 
expression of this gene product in regions of the brain suggests it plays a role in 
30 normal neural function. Potentially, this gene product is involved in synapse 
formation, neurotransmission, learning, cognition, homeostasis, or neuronal 
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differentiation or survival. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
5 available and accessible through sequence databases. Some of these sequences are 

related to SEQ ID NO: 96 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
10 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 788 of SEQ ID NO:96, b 
is an integer of 15 to 802, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:96, and where b is greater than or equal to a 
+ 14. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 87 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: ARACFAYNGVCSEGRCWDSHFHGSV (SEQ ID NO:367), 
MSNMGKIPSLSLHIPINKYICSRIPK^IQKVNKSTVLQICLKRQIILNKNKM 
20 SKIGKANLVQIDIHSLGIVETGCVPSKRYCTLLTEQSGFPFLSHP (SEQ ID 
NO:368), 

MAGCCLKLFGVLSLCFLCGLISIERVICNPVSADFQVSTFCQRHCLLR 
SKVMFXIKGXTATIEVINENCTLVAAPPIGFPIXFL (SEQ ID NO:369), MSDHS 
KIGKANLVQIDIHSLGIVETGCVPSKRYCTLLTEQSGFPFLSHP (SEQ ID 
25 NO:370), MAGCCLKLFGVLSLCFLCGLISIERVICNPVSADFQVSTFCQRHCL 
LRSK (SEQ ID NO:371), VMFXIKGXTATIEVINENCTLVAAPPIGFPIXFL (SEQ 
ID NO:372). Polynucleotides encoding these polypeptides are also encompassed by 
the invention. 

It has been discovered that this gene is expressed primarily in dendritic cells, 
30 and smooth muscle. 
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Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune, hematopoietic, and 
vascular diseases and/or disorders. Similarly, polypeptides and antibodies directed to 
5 those polypeptides are useful to provide immunological probes for differential 

identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune, expression of this gene at significantly 
higher or lower levels may be detected in certain tissues (e.g., immune, 
hematopoietic, smooth muscle vascular, and cancerous and wounded tissues) or 
10 bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue from an individual not having the 
disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
15 194 as residues: Asp-40 to Ser-52. 

The tissue distribution in dendritic cells suggests that the protein product of 
this clone would be useful for immune disorders. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
20 related to SEQ ID NO:97 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
25 general formula of a-b, where a is any integer between 1 to 1212 of SEQ ID NO:97, b 
is an integer of 15 to 1226, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:97, and where b is greater than or equal to a 
+ 14. 



30 FEATURES OF PROTEIN ENCODED BY GENE NO: 88 
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The translation product of this gene shares sequence homology with androgen- 
dependant expressed protein from golden hamster hair follicles which is thought to be 
important in regulating the secretions from glands in the skin (See GenBank 
Accession No. gil 1 9 1 3 1 5). 
5 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: PTEGRQK VLKTFT VPRS AL AMTKTSTCI YHFL VLS W YTF 
LNYYISQEGKDEVKPKILANGARWKY (SEQ ID NO:373), PTEGRQKVLKTF 
TVPRSALAMTKT (SEQ ID NO:375), PRSALAMTKTSTCIYHFLVLSWYTFLN 
YYISQEGK (SEQ ID NO:374), and/or FLNYYISQEGKDEVKPKILANGARWKY 
10 (SEQ ID NO:376). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

It has been discovered that this gene is expressed primarily in lung, colon 
cancer, and testis. 

Therefore, nucleic acids of the invention are useful as reagents for differential 

15 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: disorders of secretory cells 
including cells in the lung, colon, testis and the skin. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

20 of the above tissues or cells, particularly of the secretory epithelial cells in the lung, 

intestine, testis and skin, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues (e.g., cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 

25 the expression level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
195 as residues: Val-21 to Asp-30, Pro-101 to Thr-109. 

The tissue distribution and homology to androgen regulated protein suggests 
that the protein product of this clone would be useful for treating disorders that 

30 involve highly secretory cells including those in the colon, testis, and skin. It may be 
useful for diagnosing disorders such as colon, lung, or testicular cancer and may be 
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used to treat pulmonary conditions in patients with compromised respiratory function. 
In addition, the polynucleotides and polypeptides corresponding to this gene are 
useful for the treatment and diagnosis of conditions concerning proper testicular 
function (e.g. endocrine function, sperm maturation), as well as cancer. Therefore, 
this gene product is useful in the treatment of male infertility and/or impotence. This 
gene product is also useful in assays designed to identify binding agents, as such 
agents (antagonists) are useful as male contraceptive agents. 

Similarly, the protein is believed to be useful in the treatment and/or diagnosis 
of testicular cancer. The testes are also a site of active gene expression of transcripts 
that may be expressed, particularly at low levels, in other tissues of the body. 
Therefore, this gene product may be expressed in other specific tissues or organs 
where it may play related functional roles in other processes, such as hematopoiesis, 
inflammation, bone formation, and kidney function, to name a few possible target 
indications. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO:98 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1 106 of SEQ ID NO:98, b 
is an integer of 15 to 1 120, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:98, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 89 

The translation product of this gene shares sequence homology with dec-205 a 
transmembrane protein which is thought to be important in antigen presentation in 
dendritic cells and T-cells. 
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It has been discovered that this gene is expressed primarily in macrophage, 
dendritic cells, lung and ulcerative colitis. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
5 diagnosis of the following diseases and conditions: inflammatory diseases such as 
ulcerative colitis. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
10 lower levels may be detected in certain tissues (e.g., cancerous and wounded tissues) 
or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal fluid) 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

15 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

196 as residues: Asp-30 to Arg-36, Gln-59 to Val-65. 

The distribution in macrophage, dendritic cells, lung and ulcerative colitis 
tissues, and homology to antigen presenting receptors suggests that the protein 
product of this clone would be useful for modulating the immune response in both 

20 acute and chronic inflammatory conditions. Protein, as well as, antibodies directed 

against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 

25 related to SEQ ID NO:99 and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 

30 general formula of a-b, where a is any integer between 1 to 2582 of SEQ ID NO:99, b 
is an integer of 15 to 2596, where both a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID NO:99, and where b is greater than or equal to a 
+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 90 

5 This gene maps to chromosome 22 and therefore polynucleotides of the 

present invention can be used in linkage analysis as a marker for chromosome 22. 

In specific embodiments, polypeptides of the invention comprise the sequence 
FKDQLVYPLLAFT (SEQ ID NO: 377) and/or RQALNLPDVFGLV (SEQ ID 
NO:379). Polnucleotides encoding these polypeptides are also encompassed by the 
10 invention. 

It has been discovered that this gene is expressed primarily in fetal spleen and 
liver as well as cd34 positive cells and to a lesser extent in several tissues suggesting a 
presence in blood or blood forming tissues. 

Therefore, nucleic acids of the invention are useful as reagents for differential 

15 identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: developmental defects in the 
blood and blood forming cells. Similarly, polypeptides and antibodies directed to 
those polypeptides are useful to provide immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 

20 tissues or cells, particularly of the immune system, expression of this gene at 

significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., fetal spleen and liver as well as cd34 positive cells, cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 

25 expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
197 as residues: Gln-54 to Gly-61, Asn-79 to Leu-91, Glu-99 to Thr-105, Pro-120 to 
Gin- 126, Pro- 128 to Phe-134, Arg-150 to Arg-156, Arg-160 to Arg-170. 

30 The tissue distribution in fetal spleen and liver as well as cd34 positive cells 

suggests that the protein product of this clone would be useful for treating disorders in 
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the development, proliferation, or regulation of blood forming cells including diseases 
such as lymphomas, granulomas, leukemias, and in the preservation and or 
replenishment of stem cells in the blood. 

Many polynucleotide sequences, such as EST sequences, are publicly 
5 available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 100 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
10 are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1006 of SEQ ID NO: 100, 
b is an integer of 15 to 1020, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 100, and where b is greater than or equal to 
a +14. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 91 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: ATASHDLLLF (SEQ ID NO:379), MSINICLMQSKTQGSCQ 
YLLLPHPVPIILKVSTVFSLLSLFRLLFLSFCPHPKKCSYLLKYYGPLEGHKTLX 

20 YLRTNLGVIQPPLRMYAAEDCNGIG (SEQ ID NO:380), MSINICLMQSKTQG 
SCQYLLLPHPVPIILKVSTVFSLLSLFRLLFL (SEQ ID NO:381), and/or 
SFCPHPK KCSYLLKYYGPLEGHKTLXYLRTNLGVIQPPLRMYAAEDCNGIG 
(SEQ ID NO:382). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

25 It has been discovered that this gene is expressed primarily in T cells, fetal 

heart and chronic lymphocytic leukemia and to a lesser extent in kidney, lung, and 16 
week embryos. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
30 diagnosis of the following diseases and conditions: disorders of the blood including 
abnormalities in T cell function or blood cell proliferation such as leukemia . 
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Similarly, polypeptides and antibodies directed to those polypeptides are useful to 
provide immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels may be 
detected in certain tissues or cell types (e.g., T cells, fetal heart and chronic 
lymphocytic leukemia, cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression 
level in healthy tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
198 as residues: Leu-45 to Val-50. 

The tissue distribution in T cells, fetal heart and chronic lymphocytic leukemia 
suggests that the protein product of this clone would be useful for treating 
abnormalities of the blood particularly those involving T-cells and the abnormal 
proliferation of blood cells such as lymphocytic leukemia. In addition, it suggests the 
protein product of this clone is useful for the diagnosis and treatment of a variety of 
immune system disorders. Morever, the expression of this gene product suggests a 
role in regulating the proliferation; survival; differentiation; and/or activation of 
hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes suggesting a usefulness in the treatment of cancer (e.g. by boosting immune 
responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 
diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
transplanted organs and tissues, such as host-versus-graft and graft- versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 
injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
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rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. Moreover, the protein 
may represent a secreted factor that influences the differentiation or behavior of other 
blood cells, or that recruits hematopoietic cells to sites of injury. In addition, this gene 
product may have commercial utility in the expansion of stem cells and committed 
5 progenitors of various blood lineages, and in the differentiation and/or proliferation of 
various cell types. 

The expression in fetal heart tissue would suggest a useful role for the protein 
product in developmental abnormalities, fetal deficiencies, pre-natal disorders and 
variouswould-healing models and/or tissue trauma. The tissue distribution in kidney 

10 suggests the protein product of this clone could be used in the treatment and/or 

detection of kidney diseases including renal failure, nephritus, renal tubular.acidosis, 
proteinuria, pyuria, edema, pyelonephritis, hydronephritis, nephrotic syndrome, crush 
syndrome, glomerulonephritis, hematuria, renal colic and kidney stones, in addition to 
Wilm's Tumor Disease, and congenital kidney abnormalities such as horseshoe 

15 kidney, polycystic kidney, and Falconi's syndrome. 

In addition, the tissue distribution in embryonic tissue suggests the protein 
product of this clone is useful for the diagnosis, detection, and/or treatment of 
developmental disorders. Expression within embryonic tissue and other cellular 
sources marked by proliferating cells suggests this protein may play a role in the 

20 regulation of cellular division, and may show utility in the diagnosis and treatment of 
cancer and other proliferative disorders. Similarly, developmental tissues rely on 
decisions involving cell differentiation and/or apoptosis in pattern formation. 
Dysregulation of apoptosis can result in inappropriate suppression of cell death, as 
occurs in the development of some cancers, or in failure to control the extent of cell 

25 death, as is believed to occur in acquired immunodeficiency and certain 

neurodegenerative disorders, such as spinal muscular atrophy (SMA). Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types 
of degenerative conditions. Thus this protein may modulate apoptosis or tissue 

30 differentiation and would be useful in the detection, treatment, and/or prevention of 
degenerative or proliferative conditions and diseases. Protein, as well as, antibodies 
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directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
5 related to SEQ ID NO: 101 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
10 general formula of a-b, where a is any integer between 1 to 1506 of SEQ ID NO:101, 
b is an integer of 15 to 1520, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 101, and where b is greater than or equal to 
a +14. 

1 5 FEATURES OF PROTEIN ENCODED BY GENE NO: 92 

The translation product of this gene shares sequence homology with ctg4 
which is a glutamine repeat containing gene thought to be a candidate genetic disease 
locus. 

In specific embodiments, polypeptides of the invention comprise the sequence 
20 KEEDDDTERLPS KCE VCKLLSTE (SEQ ID NO:383 and 384) LQAELSRTGRSR 
EVLELGQ (SEQ ID NO:385 and 386), RQAVIVCRRRFV (SEQ ID NO:387), 
PPRWAHPKAPEGSPDPPSPPSALGLSVLPWSDSDPWHISVSPCAQREHYSPGS 
AHINSLRPLPALSLKRCKARVSSSCLYPAPAPAPAPLEIDRCDSVPPVALCSAA 
YTLRICWASVLCHRPPPSTSQPKPRARPKKGKAIFPTAQVP (SEQ ID NO:388), 
25 PPRWAHPKAPEGSPDPPSPPSALGLSVLPWSDSDPWHISVSPCAQREHYSPGS 
AHINSLRPLPALSLKRCK (SEQ ID NO:389), and/or ARVSSSCLYPAPAPAPAPL 
EIDRCDSVPPVALCSAAYTLRICWASVLCHRPPPSTSQPKPRARPKKGKAIFPT 
AQVP (SEQ ID NO:390). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 
30 It has been discovered that this gene is expressed in several tissues including 

lung, heart, kidney, adrenal gland, smooth muscle, cerebellum, and embryonic tissue. 
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Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: inherited developmental disorders 
possibly with a neuropsychiatry component. Similarly, polypeptides and antibodies 
directed to those polypeptides are useful to provide immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the nervous system, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, 
urine, synovial fluid or spinal fluid) taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy 
tissue from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
199 as residues: Lys-25 to Ser-36, Ser-53 to Glu-60, Thr-70 to Arg-75, Arg-1 1 1 to 
Thr-119, Glu-161 to Leu- 189. 

The tissue distribution and homology to glutamine repeat family member 
CTG4 suggests that the protein product of this clone would be useful for identifying 
and treating specific diseases related to nucleotide triplet expansion. The tissue 
distribution in embryonic tissue suggests the protein product of this clone is useful for 
the diagnosis, detection, and/or treatment of developmental disorders. The relatively 
specific expression of this gene product during embryogenesis suggests it may be a 
key player in the proliferation, maintenance, and/or differentiation of various cell 
types during development. It may also act as a morphogen to control cell and tissue 
type specification. Because of potential roles in proliferation and differentiation, this 
gene product may have applications in the adult for tissue regeneration and the 
treatment of cancers. Expression within embryonic tissue and other cellular sources 
marked by proliferating cells suggests this protein may play a role in the regulation of 
cellular division, and may show utility in the diagnosis and treatment of cancer and 
other proliferative disorders. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
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related to SEQ ID NO: 102 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 1292 of SEQ ID NO: 102, 
b is an integer of 15 to 1306, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 102, and where b is greater than or equal to 
a +14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 93 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: EEKLFTSAPGRDFWVMGETRDGNEEN (SEQ ID NO:391). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 
The gene encoding the disclosed cDNA is believed to reside on chromosome 16. 
Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 16. 

It has been discovered that this gene is expressed primarily in cancerous and 
fetal tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: cancer, developmental anomalies 
or fetal deficiencies. Similarly, polypeptides and antibodies directed to those 
polypeptides are useful to provide immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the reproductive system and developing fetus, expression of this gene 
at significantly higher or lower levels may be detected in certain tissues or cell types 
(e.g., developmental, reproductive, and cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, amniotic fluid, serum, plasma, urine, synovial fluid or spinal fluid) 
taken from an individual having such a disorder, relative to the standard gene 
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expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 
200 as residues: Met-1 to Ser-6. 
5 The tissue distribution in fetal tissue suggests that the protein product of this 

clone would be useful for the treatment and diagnosis of developmental anomalies or 
fetal deficiencies. In addition to fetal tissue, expression in a variety of cancerous 
tissues suggests a role in the treatment and diagnosis of uncontrolled cell proliferation 
and/or differentiation (e.g. cancer). Moreover, the expression within embryonic tissue 

10 and other cellular sources marked by proliferating cells suggests this protein may play 
a role in the regulation of cellular division, and may show utility in the diagnosis and 
treatment of cancer and other proliferative disorders. 

Similarly, developmental tissues rely on decisions involving cell 
differentiation and/or apoptosis in pattern formation. Dysregulation of apoptosis can 

15 result in inappropriate suppression of cell death, as occurs in the development of some 
cancers, or in failure to control the extent of cell death, as is believed to occur in 
acquired immunodeficiency and certain neurodegenerative disorders, such as spinal 
muscular atrophy (SMA). Therefore, the polynucleotides and polypeptides of the 
present invention are useful in treating, detecting, and/or preventing said disorders 

20 and conditions, in addition to other types of degenerative conditions. Thus this protein 
may modulate apoptosis or tissue differentiation and would be useful in the detection, 
treatment, and/or prevention of degenerative or proliferative conditions and diseases. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

25 Many polynucleotide sequences, such as EST sequences, are publicly 

available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 103 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 

30 would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
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general formula of a-b, where a is any integer between 1 to 771 of SEQ ID NO: 103, b 
is an integer of 15 to 785, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 103, and where b is greater than or equal to 
a+ 14. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 94 

The gene encoding the disclosed cDNA is believed to reside on chromosome 
10. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 10. 
10 This gene is expressed primarily in hypothalamus, T-cells, and adipose tissue. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 
diagnosis of the following diseases and conditions: immune (e.g. immunodeficiencies, 
autoimmunities, inflammation, leukemias & lymphomas) and neurological (e.g. 
15 Alzheimer's disease, dementia, schizophrenia) disorders. Similarly, polypeptides and 
antibodies directed to those polypeptides are useful to provide immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the central nervous, hematopoietic and 
immune systems, expression of this gene at significantly higher or lower levels may 
20 be detected in certain tissues (e.g., immune, neural, metabolic, and cancerous and 

wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. The tissue distribution suggests that the protein product of this 
25 clone would be useful in the intervention or detection of pathologies associated with 
the hematopoietic and immune systems, such as anemias (leukemias). In addition, the 
expression in brain (including fetal) might suggest a role in developmental brain 
defects, neuro-degenerative diseases or behavioral abnomalities (e.g. schizophrenia, 
Alzheimer's, dementia, depression, etc.). 
30 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

201 as residues: Phe-64 to Gly-77, Pro-83 to Asp-99. 
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The tissue distribution in hypothalamus suggests the protein product of this 
clone is useful for the detection, treatment, and/or prevention of neurodegenerative 
disease states, behavioral disorders, or inflammatory conditions which include, but 
are not limited to Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, 
5 Tourette Syndrome, meningitis, encephalitis, demyelinating diseases, peripheral 
neuropathies, neoplasia, trauma, congenital malformations, spinal cord injuries, 
ischemia and infarction, aneurysms, hemorrhages, schizophrenia, mania, dementia, 
paranoia, obsessive compulsive disorder, depression, panic disorder, learning 
disabilities, ALS, psychoses, autism, and altered behaviors, including disorders in 

10 feeding, sleep patterns, balance, and perception. In addition, elevated expression of 
this gene product in regions of the brain suggests it plays a role in normal neural 
function. Potentially, this gene product is involved in synapse formation, 
neurotransmission, learning, cognition, homeostasis, or neuronal differentiation or 
survival. This gene product may be involved in the regulation of cytokine production, 

15 antigen presentation, or other processes suggesting a usefulness in the treatment of 
cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene 
product may be involved in immune functions. Therefore it may be also used as an 
agent for immunological disorders including arthritis, asthma, immunodeficiency 

20 diseases such as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, 
inflammatory bowel disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, 
hypersensitivities, such as T-cell mediated cytotoxicity; immune reactions to 
transplanted organs and tissues, such as host-versus-graft and graft-versus-host 
diseases, or autoimmunity disorders, such as autoimmune infertility, lense tissue 

25 injury, demyelination, systemic lupus erythematosis, drug induced hemolytic anemia, 
rheumatoid arthritis, Sjogren's disease, scleroderma and tissues. 

Moreover, the protein may represent a secreted factor that influences the 
differentiation or behavior of other blood cells, or that recruits hematopoietic cells to 
sites of injury. In addition, this gene product may have commercial utility in the 

30 expansion of stem cells and committed progenitors of various blood lineages, and in 
the differentiation and/or proliferation of various cell types. Moreover, the protein 
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product of this clone is useful for the diagnosis, prevention, and/or treatment of 
various metabolic disorders which include, but are not limited to, Tay-Sachs disease, 
phenylkenonuria, galactosemia, hyperlipidemias, porphyrias, and Hurler's syndrome. 
The protein is useful in the treatment and/or prevention of neurodegenerative 
conditions, particularly those which occur secondary to aberrant fatty acid 
metabolism (i.e. defects which affect the synthesis and integrity of the myelin sheath). 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
related to SEQ ID NO: 104 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
general formula of a-b, where a is any integer between 1 to 2001 of SEQ ID NO: 104, 
b is an integer of 15 to 2015, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 104, and where b is greater than or equal to 
a+ 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 95 

The translation product of this gene was shown to have homology to the 
murine leucine-rich repeat protein (See Genbank Accession No. gil2880079), which is 
thought to be important in neural development. 

In specific embodiments, the polypeptides of the invention comprise the 
sequence:QKPTFALGELYPPLINLWEAGKEKSTSLKVKATVIGLPTNMS (SEQ 
ID NO: 392). Polynucleotides encoding this polypeptide are also encompassed by the 
invention. The gene encoding the disclosed cDNA is believed to reside on 
chromosome 7. Accordingly, polynucleotides related to this invention are useful as 
a marker in linkage analysis for chromosome 7. 
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It has been discovered that this gene is expressed primarily in T-cells and 

brain. 

Therefore, nucleic acids of the invention are useful as reagents for differential 
identification of the tissue(s) or cell type(s) present in a biological sample and for 

5 diagnosis of the following diseases and conditions: immunodeficiency, tumor 
necrosis, infection, lymphomas, auto-immunities, cancer, inflammation, anemias 
(leukemia) and other hematopoeitic disorders, neurological diseases of the brain such 
as depression, schizophrenia, Alzheimer's disease, Parkinson's disease, Huntington's 
disease, dementia and specific brain tumors. Similarly, polypeptides and antibodies 

10 directed to those polypeptides are useful to provide immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the brain and immune system, expression of 
this gene at significantly higher or lower levels may be detected in certain tissues or 
cell types (e.g., neural, immune, hematopoietic, and cancerous and wounded tissues) 

15 or bodily fluids (e.g., lymph, amniotic fluid, serum, plasma, urine, synovial fluid or 
spinal fluid) taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue from an individual 
not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

20 202 as residues: Met-24 to Gly-29, Ala-57 to Thr-63. 

The tissue distribution in T-cells suggests that the protein product of this clone 
would be useful for the diagnosis and treatment of immune disorders including: 
leukemias, lymphomas, auto-immunities, immunodeficiencies (e.g. AIDS), immuno- 
supressive conditions (transplantation) and hematopoeitic disorders. In addition this 

25 gene product may be applicable in conditions of general microbial infection, 

inflammation or cancer. The expression in brain, combined with the homology to the 
leucine-rich repeat protein suggests that the protein product of this clone would be 
useful for the treatment and diagnosis of developmental, degenerative and behavioral 
conditions of the brain and nervous system, such as depression, schizophrenia, 

30 Alzheimer's disease, Parkinson's disease, Huntington's disease, Tourette Syndrome, 
mania, dementia, paranoia, addictive behavior, obsessive-compulsisve disorder and 
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sleep disorders. Protein, as well as, antibodies directed against the protein may show 
utility as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly 
available and accessible through sequence databases. Some of these sequences are 
5 related to SEQ It) NO: 105 and may have been publicly available prior to conception 
of the present invention. Preferably, such related polynucleotides are specifically 
excluded from the scope of the present invention. To list every related sequence 
would be cumbersome. Accordingly, preferably excluded from the present invention 
are one or more polynucleotides comprising a nucleotide sequence described by the 
10 general formula of a-b, where a is any integer between 1 to 353 of SEQ ID NO: 105, b 
is an integer of 15 to 367, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO: 105, and where b is greater than or equal to 
a + 14. 
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Table 1 summarizes the information corresponding to each "Gene No." described 
above. The nucleotide sequence identified as "NT SEQ ID NO:X" was assembled 
from partially homologous ("overlapping") sequences obtained from the "cDNA 
clone ID" identified in Table 1 and, in some cases, from additional related DNA 
5 clones. The overlapping sequences were assembled into a single contiguous sequence 
of high redundancy (usually three to five overlapping sequences at each nucleotide 
position), resulting in a final sequence identified as SEQ ID NO:X. 

The cDNA Clone ID was deposited on the date and given the corresponding 
deposit number listed in "ATCC Deposit No:Z and Date." Some of the deposits 
10 contain multiple different clones corresponding to the same gene. "Vector" refers to 
the type of vector contained in the cDNA Clone ID. 

"Total NT Seq." refers to the total number of nucleotides in the contig 
identified by "Gene No." The deposited clone may contain all or most of these 
sequences, reflected by the nucleotide position indicated as "5* NT of Clone Seq." 
15 and the "3* NT of Clone Seq." of SEQ ID NO:X. The nucleotide position of SEQ ID 
NO:X of the putative start codon (methionine) is identified as "5' NT of Start Codon." 
Similarly , the nucleotide position of SEQ ID NO:X of the predicted signal sequence 
is identified as "5' NT of First AA of Signal Pep." 

The translated amino acid sequence, beginning with the methionine, is 
20 identified as "AA SEQ ID NO: Y," although other reading frames can also be easily 
translated using known molecular biology techniques. The polypeptides produced by 
these alternative open reading frames are specifically contemplated by the present 
invention. 

The first and last amino acid position of SEQ ID NO: Y of the predicted signal 
25 peptide is identified as "First AA of Sig Pep" and "Last AA of Sig Pep." The 
predicted first amino acid position of SEQ ID NO:Y of the secreted portion is 
identified as "Predicted First AA of Secreted Portion." Finally, the amino acid 
position of SEQ ID NO: Y of the last amino acid in the open reading frame is 
identified as "Last AA of ORF." 
30 SEQ ID NO:X and the translated SEQ ID NO: Y are sufficiently accurate and 

otherwise suitable for a variety of uses well known in the art and described further 



BNSDOCID: <WO 9947540A1 J_> 



WO 99/47540 



PCT/US99/05804 



190 

below. For instance, SEQ ID NO:X is useful for designing nucleic acid hybridization 
probes that will detect nucleic acid sequences contained in SEQ ID NO:X or the 
cDNA contained in the deposited clone. These probes will also hybridize to nucleic 
acid molecules in biological samples, thereby enabling a variety of forensic and 
diagnostic methods of the invention. Similarly, polypeptides identified from SEQ ID 
NO:Y may be used to generate antibodies which bind specifically to the secreted 
proteins encoded by the cDNA clones identified in Table 1 . 

Nevertheless, DNA sequences generated by sequencing reactions can contain 
sequencing errors. The errors exist as misidentified nucleotides, or as insertions or 
deletions of nucleotides in the generated DNA sequence. The erroneously inserted or 
deleted nucleotides cause frame shifts in the reading frames of the predicted amino 
acid sequence. In these cases, the predicted amino acid sequence diverges from the 
actual amino acid sequence, even though the generated DNA sequence may be greater 
than 99.9% identical to the actual DNA sequence (for example, one base insertion or 
deletion in an open reading frame of over 1000 bases). 

Accordingly, for those applications requiring precision in the nucleotide 
sequence or the amino acid sequence, the present invention provides not only the 
generated nucleotide sequence identified as SEQ ID NO:X and the predicted 
translated amino acid sequence identified as SEQ ID NO: Y, but also a sample of 
plasmid DNA containing a human cDNA of the invention deposited with the ATCC, 
as set forth in Table 1 . The nucleotide sequence of each deposited clone can readily 
be determined by sequencing the deposited clone in accordance with known methods. 
The predicted amino acid sequence can then be verified from such deposits. 
Moreover, the amino acid sequence of the protein encoded by a particular clone can 
also be directly determined by peptide sequencing or by expressing the protein in a 
suitable host cell containing the deposited human cDNA, collecting the protein, and 
determining its sequence. 

The present invention also relates to the genes corresponding to SEQ ID 
NO:X, SEQ ID NO: Y, or the deposited clone. The corresponding gene can be 
isolated in accordance with known methods using the sequence information disclosed 
herein. Such methods include preparing probes or primers from the disclosed 
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sequence and identifying or amplifying the corresponding gene from appropriate 
sources of genomic material. 

Also provided in the present invention are species homologs. Species 
homologs may be isolated and identified by making suitable probes or primers from 
5 the sequences provided herein and screening a suitable nucleic acid source for the 
desired homologue. 

The polypeptides of the invention can be prepared in any suitable manner. 
Such polypeptides include isolated naturally occurring polypeptides, recombinantly 
produced polypeptides, synthetically produced polypeptides, or polypeptides 
10 produced by a combination of these methods. Means for preparing such polypeptides 
are well understood in the art. 

The polypeptides may be in the form of the secreted protein, including the 
mature form, or may be a part of a larger protein, such as a fusion protein (see below). 
It is often advantageous to include an additional amino acid sequence which contains 
15 secretory or leader sequences, pro-sequences, sequences which aid in purification , 
such as multiple histidine residues, or an additional sequence for stability during 
recombinant production. 

The polypeptides of the present invention are preferably provided in an 
isolated form, and preferably are substantially purified. A recombinantly produced 
20 version of a polypeptide, including the secreted polypeptide, can be substantially 
purified by the one-step method described in Smith and Johnson, Gene 67:31-40 
(1988). Polypeptides of the invention also can be purified from natural or 
recombinant sources using antibodies of the invention raised against the secreted 
protein in methods which are well known in the art. 

25 

Signal Sequences 

Methods for predicting whether a protein has a signal sequence, as well as the 
cleavage point for that sequence, are available. For instance, the method of 
McGeoch, Virus Res. 3:271-286 (1985), uses the information from a short N-terminal 
30 charged region and a subsequent uncharged region of the complete (uncleaved) 

protein. The method of von Heinje, Nucleic Acids Res. 14:4683-4690 (1986) uses the 
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information from the residues surrounding the cleavage site, typically residues -13 to 
+2, where +1 indicates the amino terminus of the secreted protein. The accuracy of 
predicting the cleavage points of known mammalian secretory proteins for each of 
these methods is in the range of 75-80%. (von Heinje, supra.) However, the two 
methods do not always produce the same predicted cleavage point(s) for a given 
protein. 

In the present case, the deduced amino acid sequence of the secreted 
polypeptide was analyzed by a computer program called SignalP (Henrik Nielsen et 
al M Protein Engineering 10:1-6 (1997)), which predicts the cellular location of a 
protein based on the amino acid sequence. As part of this computational prediction of 
localization, the methods of McGeoch and von Heinje are incorporated. The analysis 
of the amino acid sequences of the secreted proteins described herein by this program 
provided the results shown in Table 1. 

As one of ordinary skill would appreciate, however, cleavage sites sometimes 
vary from organism to organism and cannot be predicted with absolute certainty. 
Accordingly, the present invention provides secreted polypeptides having a sequence 
shown in SEQ ID NO: Y which have an N-terminus beginning within 5 residues (i.e., 
+ or - 5 residues) of the predicted cleavage point. Similarly, it is also recognized that 
in some cases, cleavage of the signal sequence from a secreted protein is not entirely 
uniform, resulting in more than one secreted species. These polypeptides, and the 
polynucleotides encoding such polypeptides, are contemplated by the present 
invention. 

Moreover, the signal sequence identified by the above analysis may not 
necessarily predict the naturally occurring signal sequence. For example, the 
naturally occurring signal sequence may be further upstream from the predicted signal 
sequence. However, it is likely that the predicted signal sequence will be capable of 
directing the secreted protein to the ER. These polypeptides, and the polynucleotides 
encoding such polypeptides, are contemplated by the present invention. 

Polynucleotide and Polypeptide Variants 
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"Variant" refers to a polynucleotide or polypeptide differing from the 
polynucleotide or polypeptide of the present invention, but retaining essential 
properties thereof. Generally, variants are overall closely similar, and, in many 
regions, identical to the polynucleotide or polypeptide of the present invention. 
5 By a polynucleotide having a nucleotide sequence at least, for example, 95% 

"identical" to a reference nucleotide sequence of the present invention, it is intended 
that the nucleotide sequence of the polynucleotide is identical to the reference 
sequence except that the polynucleotide sequence may include up to five point 
mutations per each 100 nucleotides of the reference nucleotide sequence encoding the 

10 polypeptide. In other words, to obtain a polynucleotide having a nucleotide sequence 
at least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides 
in the reference sequence may be deleted or substituted with another nucleotide, or a 
number of nucleotides up to 5% of the total nucleotides in the reference sequence may 
be inserted into the reference sequence. The query sequence may be an entire 

15 sequence shown inTable 1, the ORF (open reading frame), or any fragement specified 
as described herein. 

As a practical matter, whether any particular nucleic acid molecule or 
polypeptide is at least 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide 
sequence of the presence invention can be determined conventionally using known 

20 computer programs. A preferred method for determing the best overall match 
between a query sequence (a sequence of the present invention) and a subject 
sequence, also referred to as a global sequence alignment, can be determined using 
the FASTDB computer program based on the algorithm of Brutlag et al. (Comp. App. 
Biosci. (1990) 6:237-245). In a sequence alignment the query and subject sequences 

25 are both DNA sequences. An RNA sequence can be compared by converting U's to 
T's. The result of said global sequence alignment is in percent identity. Preferred 
parameters used in a FASTDB alignment of DNA sequences to calculate percent 
identiy are: Matrix=Unitary, k-tuple=4, Mismatch Penalty=l, Joining Penalty=30, 
Randomization Group Length=0, Cutoff Score=l, Gap Penalty=5, Gap Size Penalty 

30 0.05, Window Size=500 or the lenght of the subject nucleotide sequence, whichever is 
shorter. 
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If the subject sequence is shorter than the query sequence because of 5* or 3' 
deletions, not because of internal deletions, a manual correction must be made to the 
results. This is because the FASTDB program does not account for 5' and 3' 
truncations of the subject sequence when calculating percent identity. For subject 
5 sequences truncated at the 5' or 3' ends, relative to the the query sequence, the 

percent identity is corrected by calculating the number of bases of the query sequence 
that are 5' and 3' of the subject sequence, which are not matched/aligned, as a percent 
of the total bases of the query sequence. Whether a nucleotide is matched/aligned is 
determined by results of the FASTDB sequence alignment. This percentage is then 

10 subtracted from the percent identity, calculated by the above FASTDB program using 
the specified parameters, to arrive at a final percent identity score. This corrected 
score is what is used for the purposes of the present invention. Only bases outside the 
5* and 3' bases of the subject sequence, as displayed by the FASTDB alignment, 
which are not matched/aligned with the query sequence, are calculated for the 

15 purposes of manually adjusting the percent identity score. 

For example, a 90 base subject sequence is aligned to a 100 base query 
sequence to determine percent identity. The deletions occur at the 5' end of the 
subject sequence and therefore, the FASTDB alignment does not show a 
matched/alignement of the first 10 bases at 5' end. The 10 unpaired bases represent 

20 10% of the sequence (number of bases at the 5' and 3' ends not matched/total number 
of bases in the query sequence) so 10% is subtracted from the percent identity score 
calculated by the FASTDB program. If the remaining 90 bases were perfectly 
matched the final percent identity would be 90%. In another example, a 90 base 
subject sequence is compared with a 100 base query sequence. This time the 

25 deletions are internal deletions so that there are no bases on the 5' or 3' of the subject 
sequence which are not matched/aligned with the query. In this case the percent 
identity calculated by FASTDB is not manually corrected. Once again, only bases 5' 
and 3' of the subject sequence which are not matched/aligned with the query sequnce 
are manually corrected for. No other manual corrections are to made for the purposes 

30 of the present invention. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
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"identical" to a query amino acid sequence of the present invention, it is intended that 
the amino acid sequence of the subject polypeptide is identical to the query sequence 
except that the subject polypeptide sequence may include up to five amino acid 
alterations per each 100 amino acids of the query amino acid sequence. In other 
5 words, to obtain a polypeptide having an amino acid sequence at least 95% identical 
to a query amino acid sequence, up to 5% of the amino acid residues in the subject 
sequence may be inserted, deleted, (indels) or substituted with another amino acid. 
These alterations of the reference sequence may occur at the amino or carboxy 
terminal positions of the reference amino acid sequence or anywhere between those 

10 terminal positions, interspersed either individually among residues in the reference 
sequence or in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 95%, 
96%, 97%, 98% or 99% identical to, for instance, the amino acid sequences shown in 
Table 1 or to the amino acid sequence encoded by deposited DNA clone can be 

15 determined conventionally using known computer programs. A preferred method for 
determing the best overall match between a query sequence (a sequence of the present 
invention) and a subject sequence, also referred to as a global sequence alignment, 
can be determined using the FASTDB computer program based on the algorithm of 
Brutlag et al. (Comp. App. Biosci. (1990) 6:237-245). In a sequence alignment the 

20 query and subject sequences are either both nucleotide sequences or both amino acid 
sequences. The result of said global sequence alignment is in percent identity. 
Preferred parameters used in a FASTDB amino acid alignment are: Matrix=PAM 0, 
k-tuple=2, Mismatch Penalty=l, Joining Penalty=20, Randomization Group 
Length=0, Cutoff Score=l, Window Size=sequence length, Gap Penalty=5, Gap Size 

25 Penalty=0.05, Window Size=500 or the length of the subject amino acid sequence, 
whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C- 
terminal deletions, not because of internal deletions, a manual correction must be 
made to the results. This is becuase the FASTDB program does not account for N- 

30 and C-terminal truncations of the subject sequence when calculating global percent 
identity. For subject sequences truncated at the N- and C-tennini, relative to the the 
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query sequence, the percent identity is corrected by calculating the number of residues 
of the query sequence that are N- and C-terminal of the subject sequence, which are 
not matched/aligned with a corresponding subject residue, as a percent of the total 
bases of the query sequence. Whether a residue is matched/aligned is determined by 
5 results of the FASTDB sequence alignment. This percentage is then subtracted from 
the percent identity, calculated by the above FASTDB program using the specified 
parameters, to arrive at a final percent identity score. This final percent identity score 
is what is used for the purposes of the present invention. Only residues to the N- and 
C-termini of the subject sequence, which are not matched/aligned with the query 
10 sequence, are considered for the purposes of manually adjusting the percent identity 
score. That is, only query residue positions outside the farthest N- and C-terminal 
residues of the subject sequence. 

For example, a 90 amino acid residue subject sequence is aligned with a 100 
residue query sequence to determine percent identity. The deletion occurs at the N- 

15 terminus of the subject sequence and therefore, the FASTDB alignment does not 
show a matching/alignment of the first 10 residues at the N-terminus. The 10 
unpaired residues represent 10% of the sequence (number of residues at the N- and C- 
termini not matched/total number of residues in the query sequence) so 10% is 
subtracted from the percent identity score calculated by the FASTDB program. If the 

20 remaining 90 residues were perfectly matched the final percent identity would be 
90%. In another example, a 90 residue subject sequence is compared with a 100 
residue query sequence. This time the deletions are internal deletions so there are no 
residues at the N- or C-termini of the subject sequence which are not matched/aligned 
with the query. In this case the percent identity calculated by FASTDB is not 

25 manually corrected. Once again, only residue positions outside the N- and C-terminal 
ends of the subject sequence, as displayed in the FASTDB alignment, which are not 
matched/aligned with the query sequnce are manually corrected for. No other manual 
corrections are to made for the purposes of the present invention. 

The variants may contain alterations in the coding regions, non-coding 

30 regions, or both. Especially preferred are polynucleotide variants containing 

alterations which produce silent substitutions, additions, or deletions, but do not alter 
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the properties or activities of the encoded polypeptide. Nucleotide variants produced 
by silent substitutions due to the degeneracy of the genetic code are preferred. 
Moreover, variants in which 5-10, 1-5, or 1-2 amino acids are substituted, deleted, or 
added in any combination are also preferred. Polynucleotide variants can be produced 
for a variety of reasons, e.g., to optimize codon expression for a particular host 
(change codons in the human mRNA to those preferred by a bacterial host such as E. 
coli). 

Naturally occurring variants are called "allelic variants," and refer to one of 
several alternate forms of a gene occupying a given locus on a chromosome of an 
organism. (Genes II, Lewin, B., ed., John Wiley & Sons, New York (1985).) These 
allelic variants can vary at either the polynucleotide and/or polypeptide level. 
Alternatively, non-naturally occurring variants may be produced by mutagenesis 
techniques or by direct synthesis. 

Using known methods of protein engineering and recombinant DNA 
technology, variants may be generated to improve or alter the characteristics of the 
polypeptides of the present invention. For instance, one or more amino acids can be 
deleted from the N-terminus or C-terminus of the secreted protein without substantial 
loss of biological function. The authors of Ron et aL, J. Biol. Chem. 268: 2984-2988 
(1993), reported variant KGF proteins having heparin binding activity even after 
deleting 3, 8, or 27 amino-terminal amino acid residues. Similarly, Interferon gamma 
exhibited up to ten times higher activity after deleting 8-10 amino acid residues from 
the carboxy terminus of this protein. (Dobeli et al., J. Biotechnology 1\ 199-216 
(1988).) 

Moreover, ample evidence demonstrates that variants often retain a biological 
activity similar to that of the naturally occurring protein. For example, Gayle and 
coworkers (J. Biol. Chem 268:22105-221 1 1 (1993)) conducted extensive mutational 
analysis of human cytokine IL-la. They used random mutagenesis to generate over 
3,500 individual IL-la mutants that averaged 2.5 amino acid changes per variant over 
the entire length of the molecule. Multiple mutations were examined at every 
possible amino acid position. The investigators found that "[m]ost of the molecule 
could be altered with little effect on either [binding or biological activity]." (See, 
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Abstract.) In fact, only 23 unique amino acid sequences, out of more than 3,500 
nucleotide sequences examined, produced a protein that significantly differed in 
activity from wild-type. 

Furthermore, even if deleting one or more amino acids from the N-terminus or 
C-terminus of a polypeptide results in modification or loss of one or more biological 
functions, other biological activities may still be retained. For example, the ability of 
a deletion variant to induce and/or to bind antibodies which recognize the secreted 
form will likely be retained when less than the majority of the residues of the secreted 
form are removed from the N-terminus or C-terminus. Whether a particular 
polypeptide lacking N- or C-terminal residues of a protein retains such immunogenic 
activities can readily be determined by routine methods described herein and 
otherwise known in the art. 

Thus, the invention further includes polypeptide variants which show 
substantial biological activity. Such variants include deletions, insertions, 
inversions, repeats, and substitutions selected according to general rules known in the 
art so as have little effect on activity. For example, guidance concerning how to make 
phenotypically silent amino acid substitutions is provided in Bowie, J. U. et al., 
Science 247:1306-1310 (1990), wherein the authors indicate that there are two main 
strategies for studying the tolerance of an amino acid sequence to change. 

The first strategy exploits the tolerance of amino acid substitutions by natural 
selection during the process of evolution. By comparing amino acid sequences in 
different species, conserved amino acids can be identified. These conserved amino 
acids are likely important for protein function. In contrast, the amino acid positions 
where substitutions have been tolerated by natural selection indicates that these 
positions are not critical for protein function. Thus, positions tolerating amino acid 
substitution could be modified while still maintaining biological activity of the 
protein. 

The second strategy uses genetic engineering to introduce amino acid changes 
at specific positions of a cloned gene to identify regions critical for protein function. 
For example, site directed mutagenesis or alanine-scanning mutagenesis (introduction 
of single alanine mutations at every residue in the molecule) can be used. 
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(Cunningham and Wells, Science 244:1081-1085 (1989).) The resulting mutant 
molecules can then be tested for biological activity. 

As the authors state, these two strategies have revealed that proteins are 
surprisingly tolerant of amino acid substitutions. The authors further indicate which 
5 amino acid changes are likely to be permissive at certain amino acid positions in the 
protein. For example, most buried (within the tertiary structure of the protein) amino 
acid residues require nonpolar side chains, whereas few features of surface side chains 
are generally conserved. Moreover, tolerated conservative amino acid substitutions 
* involve replacement of the aliphatic or hydrophobic amino acids Ala, Val, Leu and 
10 He; replacement of the hydroxyl residues Ser and Thr; replacement of the acidic 

residues Asp and Glu; replacement of the amide residues Asn and Gin, replacement of 
the basic residues Lys, Arg, and His; replacement of the aromatic residues Phe, Tyr, 
and Trp, and replacement of the small-sized amino acids Ala, Ser, Thr, Met, and Gly. 

15 Besides conservative amino acid substitution, variants of the present invention 

include (i) substitutions with one or more of the non-conserved amino acid residues, 
where the substituted amino acid residues may or may not be one encoded by the 
genetic code, or (ii) substitution with one or more of amino acid residues having a 
substituent group, or (iii) fusion of the mature polypeptide with another compound, 

20 such as a compound to increase the stability and/or solubility of the polypeptide (for 
example, polyethylene glycol), or (iv) fusion of the polypeptide with additional amino 
acids, such as an IgG Fc fusion region peptide, or leader or secretory sequence, or a 
sequence facilitating purification. Such variant polypeptides are deemed to be within 
the scope of those skilled in the art from the teachings herein. 

25 For example, polypeptide variants containing amino acid substitutions of 

charged amino acids with other charged or neutral amino acids may produce proteins 
with improved characteristics, such as less aggregation. Aggregation of 
pharmaceutical formulations both reduces activity and increases clearance due to the 
aggregate's immunogenic activity. (Pinckard et al., Clin. Exp. Immunol. 2:331-340 

30 (1967); Robbins et al., Diabetes 36: 838-845 (1987); Cleland et al., Crit. Rev. 
Therapeutic Drug Carrier Systems 10:307-377 (1993).) 
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A further embodiment of the invention relates to a polypeptide which 
comprises the amino acid sequence of the present invention having an amino acid 
sequence which contains at least one amino acid substitution, but not more than 50 
amino acid substitutions, even more preferably, not more than 40 amino acid 
5 substitutions, still more preferably, not more than 30 amino acid substitutions, and 
still even more preferably, not more than 20 amino acid substitutions. Of course, in 
order of ever-increasing preference, it is highly preferable for a polypeptide to have 
an amino acid sequence which comprises the amino acid sequence of the present 
invention, which contains at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 
10 amino acid substitutions. In specific embodiments, the number of additions, 
substitutions, and/or deletions in the amino acid sequence of the present invention or 
fragments thereof (e.g., the mature form and/or other fragments described herein), is 
1-5, 5-10, 5-25, 5-50, 10-50 or 50-150, conservative amino acid substitutions are 
preferable. 

15 

Polynucleotide and Polypeptide Fragments 

In the present invention, a "polynucleotide fragment" refers to a short 
polynucleotide having a nucleic acid sequence contained in the deposited clone or 
shown in SEQ ID NO:X. The short nucleotide fragments are preferably at least about 

20 15 nt, and more preferably at least about 20 nt, still more preferably at least about 30 
nt, and even more preferably, at least about 40 nt in length. A fragment "at least 20 nt 
in length," for example, is intended to include 20 or more contiguous bases from the 
cDNA sequence contained in the deposited clone or the nucleotide sequence shown in 
SEQ ID NO:X. These nucleotide fragments are useful as diagnostic probes and 

25 primers as discussed herein. Of course, larger fragments (e.g., 50, 150, 500, 600, 
2000 nucleotides) are preferred. 

Moreover, representative examples of polynucleotide fragments of the 
invention, include, for example, fragments having a sequence from about nucleotide 
number 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 351-400, 401- 

30 450, 451-500, 501-550, 551-600, 651-700, 701-750, 751-800, 800-850, 851-900, 901- 
950,951-1000, 1001-1050, 1051-1100, 1101-1150, 1151-1200, 1201-1250, 1251- 
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1300, 1301-1350, 1351-1400, 1401-1450, 1451-1500, 1501-1550, 1551-1600, 1601- 
1650, 1651-1700, 1701-1750, 1751-1800, 1801-1850, 1851-1900, 1901-1950, 1951- 
2000, or 2001 to the end of SEQ ID NO:X or the cDNA contained in the deposited 
clone. In this context "about" includes the particularly recited ranges, larger or 
smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both termini. 
Preferably, these fragments encode a polypeptide which has biological activity. More 
preferably, these polynucleotides can be used as probes or primers as discussed 
herein. 

In the present invention, a "polypeptide fragment" refers to a short amino acid 
sequence contained in SEQ ID NO: Y or encoded by the cDNA contained in the 
deposited clone. Protein fragments may be "free-standing," or comprised within a 
larger polypeptide of which the fragment forms a part or region, most preferably as a 
single continuous region. Representative examples of polypeptide fragments of the 
invention, include, for example, fragments from about amino acid number 1-20, 21- 
40,41-60,61-80,81-100, 102-120, 121-140, 141-160, or 161 to the end of the coding 
region. Moreover, polypeptide fragments can be about 20, 30, 40, 50, 60, 70, 80, 90, 
100, 110, 120, 130, 140, or 150 amino acids in length. In this context "about" 
includes the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) 
amino acids, at either extreme or at both extremes. 

Preferred polypeptide fragments include the secreted protein as well as the 
mature form. Further preferred polypeptide fragments include the secreted protein or 
the mature form having a continuous series of deleted residues from the amino or the 
carboxy terminus, or both. For example, any number of amino acids, ranging from 1- 
60, can be deleted from the amino terminus of either the secreted polypeptide or the 
mature form. Similarly, any number of amino acids, ranging from 1-30, can be 
deleted from the carboxy terminus of the secreted protein or mature form. 
Furthermore, any combination of the above amino and carboxy terminus deletions are 
preferred. Similarly, polynucleotide fragments encoding these polypeptide fragments 
are also preferred. 

Also preferred are polypeptide and polynucleotide fragments characterized by 
structural or functional domains, such as fragments that comprise alpha-helix and 
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alpha-helix forming regions, beta-sheet and beta-sheet-forming regions, turn and turn- 
forming regions, coil and coil-forming regions, hydrophilic regions, hydrophobic 
regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, 
surface-forming regions, substrate binding region, and high antigenic index regions. 
5 Polypeptide fragments of SEQ ID NO: Y falling within conserved domains are 
specifically contemplated by the present invention. Moreover, polynucleotide 
fragments encoding these domains are also contemplated. 

Other preferred fragments are biologically active fragments. Biologically 
active fragments are those exhibiting activity similar, but not necessarily identical, to 
10 an activity of the polypeptide of the present invention. The biological activity of the 
fragments may include an improved desired activity, or a decreased undesirable 
activity. 

Epitopes & Antibodies 

15 In the present invention, "epitopes* 1 refer to polypeptide fragments having 

antigenic or immunogenic activity in an animal, especially in a human. A preferred 
embodiment of the present invention relates to a polypeptide fragment comprising an 
epitope, as well as the polynucleotide encoding this fragment. A region of a protein 
molecule to which an antibody can bind is defined as an "antigenic epitope." In 

20 contrast, an "immunogenic epitope" is defined as a part of a protein that elicits an 
antibody response. (See, for instance, Geysen et al., Proc. Natl. Acad. Sci. USA 
81:3998- 4002(1983).) 

Fragments which function as epitopes may be produced by any conventional 
means. (See, e.g., Houghten, R. A., Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985) 

25 further described in U.S. Patent No. 4,631,21 1.) 

In the present invention, antigenic epitopes preferably contain a sequence of at 
least seven, more preferably at least nine, and most preferably between about 15 to 
about 30 amino acids. Antigenic epitopes are useful to raise antibodies, including 
monoclonal antibodies, that specifically bind the epitope. (See, for instance, Wilson 

30 et al., Cell 37:767-778 (1984); Sutcliffe, J. G. et al., Science 219:660-666 (1983).) 
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Similarly, immunogenic epitopes can be used to induce antibodies according 
to methods well known in the art. (See, for instance, Sutcliffe et al., supra; Wilson et 
al., supra; Chow, M. et al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle, F. J. et 
al., J. Gen. Virol. 66:2347-2354 (1985).) A preferred immunogenic epitope includes 
5 the secreted protein. The immunogenic epitopes may be presented together with a 
carrier protein, such as an albumin, to an animal system (such as rabbit or mouse) or, 
if it is long enough (at least about 25 amino acids), without a carrier. However, 
immunogenic epitopes comprising as few as 8 to 10 amino acids have been shown to 
be sufficient to raise antibodies capable of binding to, at the very least, linear epitopes 

10 in a denatured polypeptide (e.g., in Western blotting.) 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is 
meant to include intact molecules as well as antibody fragments (such as, for 
example, Fab and F(ab')2 fragments) which are capable of specifically binding to 
protein. Fab and F(ab')2 fragments lack the Fc fragment of intact antibody, clear 

15 more rapidly from the circulation, and may have less non-specific tissue binding than 
an intact antibody. (Wahl et al., J. Nucl. Med. 24:316-325 (1983).) Thus, these 
fragments are preferred, as well as the products of a FAB or other immunoglobulin 
expression library. Moreover, antibodies of the present invention include chimeric, 
single chain, and humanized antibodies. 

20 

Fusion Proteins 

Any polypeptide of the present invention can be used to generate fusion 
proteins. For example, the polypeptide of the present invention, when fused to a 
second protein, can be used as an antigenic tag. Antibodies raised against the 
25 polypeptide of the present invention can be used to indirectly detect the second 
protein by binding to the polypeptide. Moreover, because secreted proteins target 
cellular locations based on trafficking signals, the polypeptides of the present 
invention can be used as targeting molecules once fused to other proteins. 

Examples of domains that can be fused to polypeptides of the present 
30 invention include not only heterologous signal sequences, but also other heterologous 
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functional regions. The fusion does not necessarily need to be direct, but may occur 
through linker sequences. 

Moreover, fusion proteins may also be engineered to improve characteristics 
of the polypeptide of the present invention. For instance, a region of additional amino 
acids, particularly charged amino acids, may be added to the N-terminus of the 
polypeptide to improve stability and persistence during purification from the host cell 
or subsequent handling and storage. Also, peptide moieties may be added to the 
polypeptide to facilitate purification. Such regions may be removed prior to final 
preparation of the polypeptide. The addition of peptide moieties to facilitate handling 
of polypeptides are familiar and routine techniques in the art. 

Moreover, polypeptides of the present invention, including fragments, and 
specifically epitopes, can be combined with parts of the constant domain of 
immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins 
facilitate purification and show an increased half-life in vivo. One reported example 
describes chimeric proteins consisting of the first two domains of the human CD4- 
polypeptide and various domains of the constant regions of the heavy or light chains 
of mammalian immunoglobulins. (EP A 394,827; Traunecker et al., Nature 33 1 :84- 
86 (1988).) Fusion proteins having disulfide-linked dimeric structures (due to the 
IgG) can also be more efficient in binding and neutralizing other molecules, than the 
monomelic secreted protein or protein fragment alone. (Fountoulakis et al., J. 
Biochem. 270:3958-3964 (1995).) 

Similarly, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fusion 
proteins comprising various portions of constant region of immunoglobulin molecules 
together with another human protein or part thereof. In many cases, the Fc part in a 
fusion protein is beneficial in therapy and diagnosis, and thus can result in, for 
example, improved pharmacokinetic properties. (EP-A 0232 262.) Alternatively, 
deleting the Fc part after the fusion protein has been expressed, detected, and purified, 
would be desired. For example, the Fc portion may hinder therapy and diagnosis if 
the fusion protein is used as an antigen for immunizations. In drug discovery, for 
example, human proteins, such as hIL-5, have been fused with Fc portions for the 
purpose of high-throughput screening assays to identify antagonists of hIL-5. (See, 
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D. Bennett et aL, J. Molecular Recognition 8:52-58 (1995); K. Johanson et aL, J. Biol. 
Chem. 270:9459-9471 (1995).) 

Moreover, the polypeptides of the present invention can be fused to marker 
sequences, such as a peptide which facilitates purification of the fused polypeptide. 
5 In preferred embodiments, the marker amino acid sequence is a hexa-histidine 

peptide, such as the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, 
Chatsworth, CA, 91311), among others, many of which are commercially available. 
As described in Gentz et aL, Proc. Natl. Acad. Sci. USA 86:821-824 (1989), for 
instance, hexa-histidine provides for convenient purification of the fusion protein. 
10 Another peptide tag useful for purification, the "HA" tag, corresponds to an epitope 
derived from the influenza hemagglutinin protein. (Wilson et al., Cell 37:767 
(1984).) 

Thus, any of these above fusions can be engineered using the polynucleotides 
or the polypeptides of the present invention. 

15 

Vectors, Host Cells, and Protein Production 

The present invention also relates to vectors containing the polynucleotide of 
the present invention, host cells, and the production of polypeptides by recombinant 
techniques. The vector may be, for example, a phage, plasmid, viral, or retroviral 
20 vector. Retroviral vectors may be replication competent or replication defective. In 
the latter case, viral propagation generally will occur only in complementing host 
cells. 

The polynucleotides may be joined to a vector containing a selectable marker 
for propagation in a host. Generally, a plasmid vector is introduced in a precipitate, 
25 such as a calcium phosphate precipitate, or in a complex with a charged lipid. If the 
vector is a virus, it may be packaged in vitro using an appropriate packaging cell line 
and then transduced into host cells. 

The polynucleotide insert should be operatively linked to an appropriate 
promoter, such as the phage lambda PL promoter, the E. coli lac, trp, phoA and tac 
30 promoters, the SV40 early and late promoters and promoters of retroviral LTRs, to * 
name a few. Other suitable promoters will be known to the skilled artisan. The 
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expression constructs will further contain sites for transcription initiation, termination, 
and, in the transcribed region, a ribosome binding site for translation. The coding 
portion of the transcripts expressed by the constructs will preferably include a 
translation initiating codon at the beginning and a termination codon (UAA, UGA or 
UAG) appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolate reductase, G418 or neomycin 
resistance for eukaryotic cell culture and tetracycline, kanamycin or ampicillin 
resistance genes for culturing in E. coli and other bacteria. Representative examples 
of appropriate hosts include, but are not limited to, bacterial cells, such as E. coli, 
Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; 
insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as 
CHO, COS, 293, and Bowes melanoma cells; and plant cells. Appropriate culture 
mediums and conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE- 
9, available from QIAGEN, Inc.; pBluescript vectors, Phagescript vectors, pNH8A, 
pNH16a, pNH18A, pNH46A, available from Stratagene Cloning Systems, Inc.; and 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia Biotech, 
Inc. Among preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXTl 
and pSG available from Stratagene; and pSVK3, pBPV, pMSG and pSVL available 
from Pharmacia. Other suitable vectors will be readily apparent to the skilled artisan. 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated 
transfection, electroporation, transduction, infection, or other methods. Such methods 
are described in many standard laboratory manuals, such as Davis et al., Basic 
Methods In Molecular Biology (1986). It is specifically contemplated that the 
polypeptides of the present invention may in fact be expressed by a host cell lacking a 
recombinant vector. 

A polypeptide of this invention can be recovered and purified from 
recombinant cell cultures by well-known methods including ammonium sulfate or 
ethanol precipitation, acid extraction, anion or cation exchange chromatography, 
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phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. Most 
preferably, high performance liquid chromatography ("HPLC") is employed for 
purification. 

5 Polypeptides of the present invention, and preferably the secreted form, can 

also be recovered from: products purified from natural sources, including bodily 
fluids, tissues and cells, whether directly isolated or cultured; products of chemical 
synthetic procedures; and products produced by recombinant techniques from a 
prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher plant, 

10 insect, and mammalian cells. Depending upon the host employed in a recombinant 
production procedure, the polypeptides of the present invention may be glycosylated 
or may be non-glycosylated. In addition, polypeptides of the invention may also 
include an initial modified methionine residue, in some cases as a result of host- 
mediated processes. Thus, it is well known in the art that the N-terminal methionine 

15 encoded by the translation initiation codon generally is removed with high efficiency 
from any protein after translation in all eukaryotic cells. While the N-terminal 
methionine on most proteins also is efficiently removed in most prokaryotes, for some 
proteins, this prokaryotic removal process is inefficient, depending on the nature of 
the amino acid to which the N-terminal methionine is covalently linked. 

20 In addition to encompassing host cells containing the vector constructs 

discussed herein, the invention also encompasses primary, secondary, and 
immortalized host cells of vertebrate origin, particularly mammalian origin, that have 
been engineered to delete or replace endogenous genetic material (e.g., coding 
sequence), and/or to include genetic material (e.g., heterologous polynucleotide 

25 sequences) that is operably associated with the polynucleotides of the invention, and 
which activates, alters, and/or amplifies endogenous polynucleotides. For example, 
techniques known in the art may be used to operably associate heterologous control 
regions (e.g., promoter and/or enhancer) and endogenous polynucleotide sequences 
via homologous recombination (see, e.g., U.S. Patent No. 5,641,670, issued June 24, 

30 1997; International Publication No. WO 96/29411, published September 26, 1996; 
International Publication No. WO 94/12650, published August 4, 1994; Koller et al., 
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Proc. Natl. Acad. Sci. USA 86:8932-8935 (1989); and Zijlstra et al., Nature 342:435- 
438 (1989), the disclosures of each of which are incorporated by reference in their 
entireties). 

Uses of the Polynucleotides 

Each of the polynucleotides identified herein can be used in numerous ways as 
reagents. The following description should be considered exemplary and utilizes 
known techniques. 

The polynucleotides of the present invention are useful for chromosome 
identification. There exists an ongoing need to identify new chromosome markers, 
since few chromosome marking reagents, based on actual sequence data (repeat 
polymorphisms), are presently available. Each polynucleotide of the present 
invention can be used as a chromosome marker. 

Briefly, sequences can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp) from the sequences shown in SEQ ID NO:X. Primers can be 
selected using computer analysis so that primers do not span more than one predicted 
exon in the genomic DNA. These primers are then used for PCR screening of 
somatic cell hybrids containing individual human chromosomes. Only those hybrids 
containing the human gene corresponding to the SEQ ID NO:X will yield an 
amplified fragment. 

Similarly, somatic hybrids provide a rapid method of PCR mapping the 
polynucleotides to particular chromosomes. Three or more clones can be assigned per 
day using a single thermal cycler. Moreover, sublocalization of the polynucleotides 
can be achieved with panels of specific chromosome fragments. Other gene mapping 
strategies that can be used include in situ hybridization, prescreening with labeled 
flow-sorted chromosomes, and preselection by hybridization to construct 
chromosome specific-cDNA libraries. 

Precise chromosomal location of the polynucleotides can also be achieved 
using fluorescence in situ hybridization (FISH) of a metaphase chromosomal spread. 
This technique uses polynucleotides as short as 500 or 600 bases; however, 
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polynucleotides 2,000-4,000 bp are preferred. For a review of this technique, see 
Verma et al., "Human Chromosomes: a Manual of Basic Techniques," Pergamon 
Press, New York (1988). 

For chromosome mapping, the polynucleotides can be used individually (to 
5 mark a single chromosome or a single site on that chromosome) or in panels (for 
marking multiple sites and/or multiple chromosomes). Preferred polynucleotides 
correspond to the noncoding regions of the cDNAs because the coding sequences are 
more likely conserved within gene families, thus increasing the chance of cross 
hybridization during chromosomal mapping. 

10 Once a polynucleotide has been mapped to a precise chromosomal location, 

the physical position of the polynucleotide can be used in linkage analysis. Linkage 
analysis establishes coinheritance between a chromosomal location and presentation 
of a particular disease. (Disease mapping data are found, for example, in V. 
McKusick, Mendelian Inheritance in Man (available on line through Johns Hopkins 

15 University Welch Medical Library) .) Assuming 1 megabase mapping resolution and 
one gene per 20 kb, a cDNA precisely localized to a chromosomal region associated 
with the disease could be one of 50-500 potential causative genes. 

Thus, once coinheritance is established, differences in the polynucleotide and 
the corresponding gene between affected and unaffected individuals can be examined. 

20 First, visible structural alterations in the chromosomes, such as deletions or 

translocations, are examined in chromosome spreads or by PCR. If no structural 
alterations exist, the presence of point mutations are ascertained. Mutations observed 
in some or all affected individuals, but not in normal individuals, indicates that the 
mutation may cause the disease. However, complete sequencing of the polypeptide 

25 and the corresponding gene from several normal individuals is required to distinguish 
the mutation from a polymorphism. If a new polymorphism is identified, this 
polymorphic polypeptide can be used for further linkage analysis. 

Furthermore, increased or decreased expression of the gene in affected 
individuals as compared to unaffected individuals can be assessed using 

30 polynucleotides of the present invention. Any of these alterations (altered expression, 
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chromosomal rearrangement, or mutation) can be used as a diagnostic or prognostic 
marker. 

In addition to the foregoing, a polynucleotide can be used to control gene 
expression through triple helix formation or antisense DNA or RNA. Both methods 
5 rely on binding of the polynucleotide to DNA or RNA. For these techniques, 

preferred polynucleotides are usually 20 to 40 bases in length and complementary to 
either the region of the gene involved in transcription (triple helix - see Lee et al., 
Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 241:456 (1988); and Dervan 
et ah, Science 251: 1360 (1991) ) or to the mRNA itself (antisense - Okano, J. 
1 0 Neurochem. 56:560 ( 1 99 1 ); Oligodeoxy-nucleotides as Antisense Inhibitors of Gene 
Expression, CRC Press, Boca Raton, FL (1988).) Triple helix formation optimally 
results in a shut-off of RNA transcription from DNA, while antisense RNA 
hybridization blocks translation of an mRNA molecule into polypeptide. Both 
techniques are effective in model systems, and the information disclosed herein can 
15 be used to design antisense or triple helix polynucleotides in an effort to treat disease. 

Polynucleotides of the present invention are also useful in gene therapy. One 
goal of gene therapy is to insert a normal gene into an organism having a defective 
gene, in an effort to correct the genetic defect. The polynucleotides disclosed in the 
present invention offer a means of targeting such genetic defects in a highly accurate 
20 manner. Another goal is to insert a new gene that was not present in the host genome, 
thereby producing a new trait in the host cell. 

The polynucleotides are also useful for identifying individuals from minute 
biological samples. The United States military, for example, is considering the use of 
restriction fragment length polymorphism (RFLP) for identification of its personnel. 
25 In this technique, an individual's genomic DNA is digested with one or more 
restriction enzymes, and probed on a Southern blot to yield unique bands for 
identifying personnel. This method does not suffer from the current limitations of 
"Dog Tags n which can be lost, switched, or stolen, making positive identification 
difficult. The polynucleotides of the present invention can be used as additional DNA 
30 markers for RFLP. 
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The polynucleotides of the present invention can also be used as an alternative 
to RFLP, by determining the actual base-by-base DNA sequence of selected portions 
of an individual's genome. These sequences can be used to prepare PCR primers for 
amplifying and isolating such selected DNA, which can then be sequenced. Using 
5 this technique, individuals can be identified because each individual will have a 
unique set of DNA sequences. Once an unique ID database is established for an 
individual, positive identification of that individual, living or dead, can be made from 
extremely small tissue samples. 

Forensic biology also benefits from using DNA-based identification 

10 techniques as disclosed herein. DNA sequences taken from very small biological 
samples such as tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, semen, 
etc., can be amplified using PCR. In one prior art technique, gene sequences 
amplified from polymorphic loci, such as DQa class II HLA gene, are used in forensic 
biology to identify individuals. (Erlich, H., PCR Technology, Freeman and Co. 

15 (1992).) Once these specific polymorphic loci are amplified, they are digested with 
one or more restriction enzymes, yielding an identifying set of bands on a Southern 
blot probed with DNA corresponding to the DQa class II HLA gene. Similarly, 
polynucleotides of the present invention can be used as polymorphic markers for 
forensic purposes. 

20 There is also a need for reagents capable of identifying the source of a 

particular tissue. Such need arises, for example, in forensics when presented with 
tissue of unknown origin. Appropriate reagents can comprise, for example, DNA 
probes or primers specific to particular tissue prepared from the sequences of the 
present invention. Panels of such reagents can identify tissue by species and/or by 

25 organ type. In a similar fashion, these reagents can be used to screen tissue cultures 
for contamination. 

In the very least, the polynucleotides of the present invention can be used as 
molecular weight markers on Southern gels, as diagnostic probes for the presence of a 
specific mRNA in a particular cell type, as a probe to "subtract-out" known sequences 
30 in the process of discovering novel polynucleotides, for selecting and making 
oligomers for attachment to a "gene chip" or other support, to raise anti-DNA 
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antibodies using DNA immunization techniques, and as an antigen to elicit an 
immune response. 

Uses of the Polypeptides 

Each of the polypeptides identified herein can be used in numerous ways. The 
following description should be considered exemplary and utilizes known techniques. 

A polypeptide of the present invention can be used to assay protein levels in a 
biological sample using antibody-based techniques. For example, protein expression 
in tissues can be studied with classical immunohistological methods. (Jalkanen, M., 
et al„ J. Cell. Biol. 101:976-985 (1985); Jalkanen, M., et al., J. Cell . Biol. 105:3087- 
3096 (1987).) Other antibody-based methods useful for detecting protein gene 
expression include immunoassays, such as the enzyme linked immunosorbent assay 
(ELISA) and the radioimmunoassay (RIA). Suitable antibody assay labels are known 
in the art and include enzyme labels, such as, glucose oxidase, and radioisotopes, such 
as iodine (1251, 1211), carbon (14C), sulfur (35S), tritium (3H), indium (1 12In), and 
technetium (99mTc), and fluorescent labels, such as fluorescein and rhodamine, and 
biotin. 

In addition to assaying secreted protein levels in a biological sample, proteins 
can also be detected in vivo by imaging. Antibody labels or markers for in vivo 
imaging of protein include those detectable by X-radiography, NMR or ESR. For X- 
radiography, suitable labels include radioisotopes such as barium or cesium, which 
emit detectable radiation but are not overtly harmful to the subject. Suitable markers 
for NMR and ESR include those with a detectable characteristic spin, such as 
deuterium, which may be incorporated into the antibody by labeling of nutrients for 
the relevant hybridoma. 

A protein-specific antibody or antibody fragment which has been labeled with 
an appropriate detectable imaging moiety, such as a radioisotope (for example, 1311, 
1 12In, 99mTc), a radio-opaque substance, or a material detectable by nuclear 
magnetic resonance, is introduced (for example, parenterally, subcutaneously, or 
intraperitoneally) into the mammal. It will be understood in the art that the size of the 
subject and the imaging system used will determine the quantity of imaging moiety 
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needed to produce diagnostic images. In the case of a radioisotope moiety, for a 
human subject, the quantity of radioactivity injected will normally range from about 5 
to 20 millicuries of 99mTc. The labeled antibody or antibody fragment will then 
preferentially accumulate at the location of cells which contain the specific protein. 
5 In vivo tumor imaging is described in S.W. Burchiel et al M "Immunopharmacokinetics 
of Radiolabeled Antibodies and Their Fragments." (Chapter 13 in Tumor Imaging: 
The Radiochemical Detection of Cancer, S.W. Burchiel and B. A. Rhodes, eds., 
Masson Publishing Inc. (1982).) 

Thus, the invention provides a diagnostic method of a disorder, which 

10 involves (a) assaying the expression of a polypeptide of the present invention in cells 
or body fluid of an individual; (b) comparing the level of gene expression with a 
standard gene expression level, whereby an increase or decrease in the assayed 
polypeptide gene expression level compared to the standard expression level is 
indicative of a disorder. 

15 Moreover, polypeptides of the present invention can be used to treat disease. 

For example, patients can be administered a polypeptide of the present invention in an 
effort to replace absent or decreased levels of the polypeptide (e.g., insulin), to 
supplement absent or decreased levels of a different polypeptide (e.g., hemoglobin S 
for hemoglobin B), to inhibit the activity of a polypeptide (e.g., an oncogene), to 

20 activate the activity of a polypeptide (e.g., by binding to a receptor), to reduce the 
activity of a membrane bound receptor by competing with it for free ligand (e.g., 
soluble TNF receptors used in reducing inflammation), or to bring about a desired 
response (e.g., blood vessel growth). 

Similarly, antibodies directed to a polypeptide of the present invention can 

25 also be used to treat disease. For example, administration of an antibody directed to a 
polypeptide of the present invention can bind and reduce overproduction of the 
polypeptide. Similarly, administration of an antibody can activate the polypeptide, 
such as by binding to a polypeptide bound to a membrane (receptor). 

At the very least, the polypeptides of the present invention can be used as 

30 molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration 

columns using methods well known to those of skill in the art. Polypeptides can also 
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be used to raise antibodies, which in turn are used to measure protein expression from 
a recombinant cell, as a way of assessing transformation of the host cell. Moreover, 
the polypeptides of the present invention can be used to test the following biological 
activities. 

Biological Activities 

The polynucleotides and polypeptides of the present invention can be used in 
assays to test for one or more biological activities. If these polynucleotides and 
polypeptides do exhibit activity in a particular assay, it is likely that these molecules 
may be involved in the diseases associated with the biological activity. Thus, the 
polynucleotides and polypeptides could be used to treat the associated disease. 

Immune Activity 

A polypeptide or polynucleotide of the present invention may be useful in 
treating deficiencies or disorders of the immune system, by activating or inhibiting the 
proliferation, differentiation, or mobilization (chemotaxis) of immune cells. Immune 
cells develop through a process called hematopoiesis, producing myeloid (platelets, 
red blood cells, neutrophils, and macrophages) and lymphoid (B and T lymphocytes) 
cells from pluripotent stem cells. The etiology of these immune deficiencies or 
disorders may be genetic, somatic, such as cancer or some autoimmune disorders, 
acquired (e.g., by chemotherapy or toxins), or infectious. Moreover, a polynucleotide 
or polypeptide of the present invention can be used as a marker or detector of a 
particular immune system disease or disorder. 

A polynucleotide or polypeptide of the present invention may be useful in 
treating or detecting deficiencies or disorders of hematopoietic cells. A 
polypeptide or polynucleotide of the present invention could be used to increase 
differentiation and proliferation of hematopoietic cells, including the pluripotent stem 
cells, in an effort to treat those disorders associated with a decrease in certain (or 
many) types hematopoietic cells. Examples of immunologic deficiency syndromes 
include, but are not limited to: blood protein disorders (e.g. agammaglobulinemia, 
dysgammaglobulinemia), ataxia telangiectasia, common variable immunodeficiency, 
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Digeorge Syndrome, HIV infection, HTLV-BLV infection, leukocyte adhesion 
deficiency syndrome, lymphopenia, phagocyte bactericidal dysfunction, severe 
combined immunodeficiency (SCIDs), Wiskott-Aldrich Disorder, anemia, 
thrombocytopenia, or hemoglobinuria. 
5 Moreover, a polypeptide or polynucleotide of the present invention could also 

be used to modulate hemostatic (the stopping of bleeding) or thrombolytic activity 
(clot formation). For example, by increasing hemostatic or thrombolytic activity, a 
polynucleotide or polypeptide of the present invention could be used to treat blood 
coagulation disorders (e.g., afibrinogenemia, factor deficiencies), blood platelet 

10 disorders (e.g. thrombocytopenia), or wounds resulting from trauma, surgery, or other 
causes. Alternatively, a polynucleotide or polypeptide of the present invention that 
can decrease hemostatic or thrombolytic activity could be used to inhibit or dissolve 
clotting. These molecules could be important in the treatment of heart attacks 
(infarction), strokes, or scarring. 

15 A polynucleotide or polypeptide of the present invention may also be useful in 

treating or detecting autoimmune disorders. Many autoimmune disorders result from 
inappropriate recognition of self as foreign material by immune cells. This 
inappropriate recognition results in an immune response leading to the destruction of 
the host tissue. Therefore, the administration of a polypeptide or polynucleotide of the 

20 present invention that inhibits an immune response, particularly the proliferation, 
differentiation, or chemotaxis of T-cells, may be an effective therapy in preventing 
autoimmune disorders. 

Examples of autoimmune disorders that can be treated or detected by the 
present invention include, but are not limited to: Addison's Disease, hemolytic 

25 anemia, antiphospholipid syndrome, rheumatoid arthritis, dermatitis, allergic 

encephalomyelitis, glomerulonephritis, Goodpasture's Syndrome, Graves' Disease, 
Multiple Sclerosis, Myasthenia Gravis, Neuritis, Ophthalmia, Bullous Pemphigoid, 
Pemphigus, Polyendocrinopathies, Purpura, Reiter's Disease, Stiff-Man Syndrome, 
Autoimmune Thyroiditis, Systemic Lupus Erythematosus, Autoimmune Pulmonary 

30 Inflammation, Guillain-Barre Syndrome, insulin dependent diabetes mellitis, and 
autoimmune inflammatory eye disease. 
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Similarly, allergic reactions and conditions, such as asthma (particularly 
allergic asthma) or other respiratory problems, may also be treated by a polypeptide 
or polynucleotide of the present invention. Moreover, these molecules can be used to 
treat anaphylaxis, hypersensitivity to an antigenic molecule, or blood group 
5 incompatibility. 

A polynucleotide or polypeptide of the present invention may also be used to 
treat and/or prevent organ rejection or graft-versus-host disease (GVHD). Organ 
rejection occurs by host immune cell destruction of the transplanted tissue through an 
immune response. Similarly, an immune response is also involved in GVHD, but, in 
10 this case, the foreign transplanted immune cells destroy the host tissues. The 

administration of a polypeptide or polynucleotide of the present invention that inhibits 
an immune response, particularly the proliferation, differentiation, or chemotaxis of 
T-cells, may be an effective therapy in preventing organ rejection or GVHD. 

Similarly, a polypeptide or polynucleotide of the present invention may also 
1 5 be used to modulate inflammation. For example, the polypeptide or polynucleotide 
may inhibit the proliferation and differentiation of cells involved in an inflammatory 
response. These molecules can be used to treat inflammatory conditions, both chronic 
and acute conditions, including inflammation associated with infection (e.g., septic 
shock, sepsis, or systemic inflammatory response syndrome (SIRS)), ischemia- 
20 reperfusion injury, endotoxin lethality, arthritis, complement-mediated hyperacute 
rejection, nephritis, cytokine or chemokine induced lung injury, inflammatory bowel 
disease, Crohn's disease, or resulting from over production of cytokines (e.g., TNF or 
IL-1.) 

25 Hyperprol iferative HknrHere 

A polypeptide^ or polynucleotide can be used to treat or detect 
hyperproliferative disorders, including neoplasms. A polypeptide or polynucleotide 
of the present invention may inhibit the proliferation of the disorder through direct or 
indirect interactions. Alternatively, a polypeptide or polynucleotide of the present 
30 invention may proliferate other cells which can inhibit the hyperproliferative disorder. 
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For example, by increasing an immune response, particularly increasing 
antigenic qualities of the hyperproliferative disorder or by proliferating, 
differentiating, or mobilizing T-cells, hyperproliferative disorders can be treated. 
This immune response may be increased by either enhancing an existing immune 
5 response, or by initiating a new immune response. Alternatively, decreasing an 

immune response may also be a method of treating hyperproliferative disorders, such 
as a chemotherapeutic agent. 

Examples of hyperproliferative disorders that can be treated or detected by a 
polynucleotide or polypeptide of the present invention include, but are not limited to 
10 neoplasms located in the: abdomen, bone, breast, digestive system, liver, pancreas, 

peritoneum, endocrine glands (adrenal, parathyroid, pituitary, testicles, ovary, thymus, 
thyroid), eye, head and neck, nervous (central and peripheral), lymphatic system, 
pelvic, skin, soft tissue, spleen, thoracic, and urogenital. 

Similarly, other hyperproliferative disorders can also be treated or detected by 
15 a polynucleotide or polypeptide of the present invention. Examples of such 
hyperproliferative disorders include, but are not limited to: 

hypergammaglobulinemia, lymphoproliferative disorders, paraproteinemias, purpura, 
sarcoidosis, Sezary Syndrome, Waldenstron's Macroglobulinemia, Gaucher' s 
Disease, histiocytosis, and any other hyperproliferative disease, besides neoplasia, 
20 located in an organ system listed above. 

Infectious Disease 

A polypeptide or polynucleotide of the present invention can be used to treat 
or detect infectious agents. For example, by increasing the immune response, 

25 particularly increasing the proliferation and differentiation of B and/or T cells, 

infectious diseases may be treated. The immune response may be increased by either 
enhancing an existing immune response, or by initiating a new immune response. 
Alternatively, the polypeptide or polynucleotide of the present invention may also 
directly inhibit the infectious agent, without necessarily eliciting an immune response. 

30 Viruses are one example of an infectious agent that can cause disease or 

symptoms that can be treated or detected by a polynucleotide or polypeptide of the 
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present invention. Examples of viruses, include, but are not limited to the following 
DNA and RNA viral families: Arbovirus, Adenoviridae, Arenaviridae, Arterivirus, 
Birnaviridae, Bunyaviridae, Caliciviridae, Circoviridae, Coronaviridae, Flaviviridae, 
Hepadnaviridae (Hepatitis), Herpesviridae (such as, Cytomegalovirus, Herpes 
5 Simplex, Herpes Zoster), Mononegavirus (e.g., Paramyxoviridae, Morbillivirus, 
Rhabdoviridae), Orthomyxoviridae (e.g., Influenza), Papovaviridae, Parvoviridae, 
Picornaviridae, Poxviridae (such as Smallpox or Vaccinia), Reoviridae (e.g., 
Rotavirus), Retroviridae (HTLV-I, HTLV-II, Lentivirus), and Togaviridae (e.g., 
Rubivirus). Viruses falling within these families can cause a variety of diseases or 
10 symptoms, including, but not limited to: arthritis, bronchiolitis, encephalitis, eye 

infections (e.g., conjunctivitis, keratitis), chronic fatigue syndrome, hepatitis (A, B, C, 
E, Chronic Active, Delta), meningitis, opportunistic infections (e.g., AIDS), 
pneumonia, Burkitt's Lymphoma, chickenpox , hemorrhagic fever, Measles, Mumps, 
Parainfluenza, Rabies, the common cold, Polio, leukemia, Rubella, sexually 
15 transmitted diseases, skin diseases (e.g., Kaposi's, warts), and viremia. A polypeptide 
or polynucleotide of the present invention can be used to treat or detect any of these 
symptoms or diseases. 

Similarly, bacterial or fungal agents that can cause disease or symptoms and 
that can be treated or detected by a polynucleotide or polypeptide of the present 
20 invention include, but not limited to, the following Gram-Negative and Gram-positive 
bacterial families and fungi: Actinomycetales (e.g., Corynebacterium, 
Mycobacterium, Norcardia), Aspergillosis, Bacillaceae (e.g., Anthrax, Clostridium), 
Bacteroidaceae, Blastomycosis, Bordetella, Borrelia, Brucellosis, Candidiasis, 
Campylobacter, Coccidioidomycosis, Cryptococcosis, Dermatocycoses, 
25 Enterobacteriaceae (Klebsiella, Salmonella, Serratia, Yersinia), Erysipelothrix, 

Helicobacter, Legionellosis, Leptospirosis, Listeria, Mycoplasmatales, Neisseriaceae 
(e.g., Acinetobacter, Gonorrhea, Menigococcal), Pasteurellacea Infections (e.g., 
Actinobacillus, Heamophilus, Pasteurella), Pseudomonas, Rickettsiaceae, 
Chlamydiaceae, Syphilis, and Staphylococcal. These bacterial or fungal families can 
30 cause the following diseases or symptoms, including, but not limited to: bacteremia, 
endocarditis, eye infections (conjunctivitis, tuberculosis, uveitis), gingivitis, 
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opportunistic infections (e.g., AIDS related infections), paronychia, prosthesis-related 
infections, Reiter's Disease, respiratory tract infections, such as Whooping Cough or 
Empyema, sepsis, Lyme Disease, Cat-Scratch Disease, Dysentery, Paratyphoid Fever, 
food poisoning, Typhoid, pneumonia, Gonorrhea, meningitis, Chlamydia, Syphilis, 
5 Diphtheria, Leprosy, Paratuberculosis, Tuberculosis, Lupus, Botulism, gangrene, 

tetanus, impetigo, Rheumatic Fever, Scarlet Fever, sexually transmitted diseases, skin 
diseases (e.g., cellulitis, dermatocycoses), toxemia, urinary tract infections, wound 
infections. A polypeptide or polynucleotide of the present invention can be used to 
treat or detect any of these symptoms or diseases. 

10 Moreover, parasitic agents causing disease or symptoms that can be treated or 

detected by a polynucleotide or polypeptide of the present invention include, but not 
limited to, the following families: Amebiasis, Babesiosis, Coccidiosis, 
Cryptosporidiosis, Dientamoebiasis, Dourine, Ectoparasitic, Giardiasis, 
Helminthiasis, Leishmaniasis, Theileriasis, Toxoplasmosis, Trypanosomiasis, and 

15 Trichomonas. These parasites can cause a variety of diseases or symptoms, including, 
but not limited to: Scabies, Trombiculiasis, eye infections, intestinal disease (e.g., 
dysentery, giardiasis), liver disease, lung disease, opportunistic infections (e.g., AIDS 
related), Malaria, pregnancy complications, and toxoplasmosis. A polypeptide or 
polynucleotide of the present invention can be used to treat or detect any of these 

20 symptoms or diseases. 

Preferably, treatment using a polypeptide or polynucleotide of the present 
invention could either be by administering an effective amount of a polypeptide to the 
patient, or by removing cells from the patient, supplying the cells with a 
polynucleotide of the present invention, and returning the engineered cells to the 

25 patient (ex vivo therapy). Moreover, the polypeptide or polynucleotide of the present 
invention can be used as an antigen in a vaccine to raise an immune response against 
infectious disease. 

Re generation 

30 A polynucleotide or polypeptide of the present invention can be used to 

differentiate, proliferate, and attract cells, leading to the regeneration of tissues. (See, 
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Science 276:59-87 (1997).) The regeneration of tissues could be used to repair, 
replace, or protect tissue damaged by congenital defects, trauma (wounds, bums, 
incisions, or ulcers), age, disease (e.g. osteoporosis, osteocarthritis, periodontal 
disease, liver failure), surgery, including cosmetic plastic surgery, fibrosis, 
5 reperfusion injury, or systemic cytokine damage. 

Tissues that could be regenerated using the present invention include organs 
(e.g., pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal 
or cardiac), vasculature (including vascular and lymphatics), nervous, hematopoietic, 
and skeletal (bone, cartilage, tendon, and ligament) tissue. Preferably, regeneration- 
10 occurs without or decreased scarring. Regeneration also may include angiogenesis. 

Moreover, a polynucleotide or polypeptide of the present invention may 
increase regeneration of tissues difficult to heal. For example, increased 
tendon/ligament regeneration would quicken recovery time after damage. A 
polynucleotide or polypeptide of the present invention could also be used 
15 prophylactically in an effort to avoid damage. Specific diseases that could be treated 
include of tendinitis, carpal tunnel syndrome, and other tendon or ligament defects. A 
further example of tissue regeneration of non-healing wounds includes pressure 
ulcers, ulcers associated with vascular insufficiency, surgical, and traumatic wounds. 
Similarly, nerve and brain tissue could also be regenerated by using a 
20 polynucleotide or polypeptide of the present invention to proliferate and differentiate 
nerve cells. Diseases that could be treated using this method include central and 
peripheral nervous system diseases, neuropathies, or mechanical and traumatic 
disorders (e.g., spinal cord disorders, head trauma, cerebrovascular disease, and 
stoke). Specifically, diseases associated with peripheral nerve injuries, peripheral 
25 neuropathy (e.g., resulting from chemotherapy or other medical therapies), localized 
neuropathies, and central nervous system diseases (e.g., Alzheimer's disease, 
Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy- 
Drager syndrome), could all be treated using the polynucleotide or polypeptide of the 
present invention. 

30 

Chemotaxis 
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A polynucleotide or polypeptide of the present invention may have 
chemotaxis activity. A chemotaxic molecule attracts or mobilizes cells (e.g., 
monocytes, fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or 
endothelial cells) to a particular site in the body, such as inflammation, infection, or 
5 site of hyperproliferation. The mobilized cells can then fight off and/or heal the 
particular trauma or abnormality. 

A polynucleotide or polypeptide of the present invention may increase 
chemotaxic activity of particular cells. These chemotactic molecules can then be used 
to treat inflammation, infection, hyperproliferative disorders, or any immune system 
10 disorder by increasing the number of cells targeted to a particular location in the body. 
For example, chemotaxic molecules can be used to treat wounds and other trauma to 
tissues by attracting immune cells to the injured location. Chemotactic molecules of 
the present invention can also attract fibroblasts, which can be used to treat wounds. 

It is also contemplated that a polynucleotide or polypeptide of the present 
15 invention may inhibit chemotactic activity. These molecules could also be used to 
treat disorders. Thus, a polynucleotide or polypeptide of the present invention could 
be used as an inhibitor of chemotaxis. 

Binding Activity 

20 A polypeptide of the present invention may be used to screen for molecules 

that bind to the polypeptide or for molecules to which the polypeptide binds. The 
binding of the polypeptide and the molecule may activate (agonist), increase, inhibit 
(antagonist), or decrease activity of the polypeptide or the molecule bound. Examples 
of such molecules include antibodies, oligonucleotides, proteins (e.g., receptors),or 

25 small molecules. 

Preferably, the molecule is closely related to the natural ligand of the 
polypeptide, e.g., a fragment of the ligand, or a natural substrate, a ligand, a structural 
or functional mimetic. (See, Coligan et al., Current Protocols in Immunology 
l(2):Chapter 5 (1991).) Similarly, the molecule can be closely related to the natural 

30 receptor to which the polypeptide binds, or at least, a fragment of the receptor capable 
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of being bound by the polypeptide (e.g., active site). In either case, the molecule can 
be rationally designed using known techniques. 

Preferably, the screening for these molecules involves producing appropriate 
cells which express the polypeptide, either as a secreted protein or on the cell 
membrane. Preferred cells include cells from mammals, yeast, Drosophila, or E. coli. 
Cells expressing the polypeptide (or cell membrane containing the expressed 
polypeptide) are then preferably contacted with a test compound potentially 
containing the molecule to observe binding, stimulation, or inhibition of activity of 
either the polypeptide or the molecule. 

The assay may simply test binding of a candidate compound to the 
polypeptide, wherein binding is detected by a label, or in an assay involving 
competition with a labeled competitor. Further, the assay may test whether the 
candidate compound results in a signal generated by binding to the polypeptide. 

Alternatively, the assay can be carried out using cell-free preparations, 
polypeptide/molecule affixed to a solid support, chemical libraries, or natural product 
mixtures. The assay may also simply comprise the steps of mixing a candidate 
compound with a solution containing a polypeptide, measuring polypeptide/molecule 
activity or binding, and comparing the polypeptide/molecule activity or binding to a 
standard. 

Preferably, an ELISA assay can measure polypeptide level or activity in a 
sample (e.g., biological sample) using a monoclonal or polyclonal antibody. The 
antibody can measure polypeptide level or activity by either binding, directly or 
indirectly, to the polypeptide or by competing with the polypeptide for a substrate. 

All of these above assays can be used as diagnostic or prognostic markers. 
The molecules discovered using these assays can be used to treat disease or to bring 
about a particular result in a patient (e.g., blood vessel growth) by activating or 
inhibiting the polypeptide/molecule. Moreover, the assays can discover agents which 
may inhibit or enhance the production of the polypeptide from suitably manipulated 
cells or tissues. 

Therefore, the invention includes a method of identifying compounds which 
bind to a polypeptide of the invention comprising the steps of: (a) incubating a 
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candidate binding compound with a polypeptide of the invention; and (b) determining 
if binding has occurred. Moreover, the invention includes a method of identifying 
agonists/antagonists comprising the steps of: (a) incubating a candidate compound 
with a polypeptide of the invention, (b) assaying a biological activity , and (b) 
5 determining if a biological activity of the polypeptide has been altered. 



Other Activities 

A polypeptide or polynucleotide of the present invention may also increase or 
decrease the differentiation or proliferation of embryonic stem cells, besides, as 

10 discussed above, hematopoietic lineage. 

A polypeptide or polynucleotide of the present invention may also be used to 
modulate mammalian characteristics, such as body height, weight, hair color, eye 
color, skin, percentage of adipose tissue, pigmentation, size, and shape (e.g., cosmetic 
surgery). Similarly, a polypeptide or polynucleotide of the present invention may be 

15 used to modulate mammalian metabolism affecting catabolism, anabolism, 
processing, utilization, and storage of energy. 

A polypeptide or polynucleotide of the present invention may be used to 
change a mammal's mental state or physical state by influencing biorhythms, 
caricadic rhythms, depression (including depressive disorders), tendency for violence, 

20 tolerance for pain, reproductive capabilities (preferably by Activin or Inhibin-like 
activity), hormonal or endocrine levels, appetite, libido, memory, stress, or other 
cognitive qualities. 

A polypeptide or polynucleotide of the present invention may also be used as a 
food additive or preservative, such as to increase or decrease storage capabilities, fat 
25 content, lipid, protein, carbohydrate, vitamins, minerals, cofactors or other nutritional 
components. 



Other Preferred Embodiments 

Other preferred embodiments of the claimed invention include an isolated 
30 nucleic acid molecule comprising a nucleotide sequence which is at least 95% 
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identical to a sequence of at least about 50 contiguous nucleotides in the nucleotide 
sequence of SEQ ID NO:X wherein X is any integer as defined in Table 1. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5' Nucleotide of 
the Clone Sequence and ending with the nucleotide at about the position of the 3' 
Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table L 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5' Nucleotide of 
the Start Codon and ending with the nucleotide at about the position of the 3' 
Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table 1. 

Similarly preferred is a nucleic acid molecule wherein said sequence of 
contiguous nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the 
range of positions beginning with the nucleotide at about the position of the 5' 
Nucleotide of the First Amino Acid of the Signal Peptide and ending with the 
nucleotide at about the position of the 3' Nucleotide of the Clone Sequence as defined 
for SEQ ID NO:X in Table 1. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least about 150 
contiguous nucleotides in the nucleotide sequence of SEQ ID NO:X. 

Further preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least about 500 
contiguous nucleotides in the nucleotide sequence of SEQ ID NO:X. 

A further preferred embodiment is a nucleic acid molecule comprising a 
nucleotide sequence which is at least 95% identical to the nucleotide sequence of SEQ 
ID NO:X beginning with the nucleotide at about the position of the 5' Nucleotide of 
the First Amino Acid of the Signal Peptide and ending with the nucleotide at about 
the position of the 3' Nucleotide of the Clone Sequence as defined for SEQ ID NO:X 
in Table 1 . 
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A further preferred embodiment is an isolated nucleic acid molecule 
comprising a nucleotide sequence which is at least 95% identical to the complete 
nucleotide sequence of SEQ ID NO:X. 

Also preferred is an isolated nucleic acid molecule which hybridizes under 
5 stringent hybridization conditions to a nucleic acid molecule, wherein said nucleic 
acid molecule which hybridizes does not hybridize under stringent hybridization 
conditions to a nucleic acid molecule having a nucleotide sequence consisting of only 
A residues or of only T residues. 

Also preferred is a composition of matter comprising a DNA molecule which 
10 comprises a human cDNA clone identified by a cDNA Clone Identifier in Table 1, 
which DNA molecule is contained in the material deposited with the American Type 
Culture Collection and given the ATCC Deposit Number shown in Table 1 for said 
cDNA Clone Identifier. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
15 sequence which is at least 95% identical to a sequence of at least 50 contiguous 

nucleotides in the nucleotide sequence of a human cDNA clone identified by a cDNA 
Clone Identifier in Table 1 , which DNA molecule is contained in the deposit given the 
ATCC Deposit Number shown in Table 1 . 

Also preferred is an isolated nucleic acid molecule, wherein said sequence of 
20 at least 50 contiguous nucleotides is included in the nucleotide sequence of the 
complete open reading frame sequence encoded by said human cDNA clone. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to sequence of at least 150 contiguous 
nucleotides in the nucleotide sequence encoded by said human cDNA clone. 
25 A further preferred embodiment is an isolated nucleic acid molecule 

comprising a nucleotide sequence which is at least 95% identical to sequence of at 
least 500 contiguous nucleotides in the nucleotide sequence encoded by said human 
cDNA clone. 

A further preferred embodiment is an isolated nucleic acid molecule 
30 comprising a nucleotide sequence which is at least 95% identical to the complete 
nucleotide sequence encoded by said human cDNA clone. 
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A further preferred embodiment is a method for detecting in a biological 
sample a nucleic acid molecule comprising a nucleotide sequence which is at least 
95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X 
5 wherein X is any integer as defined in Table 1 ; and a nucleotide sequence encoded by 
a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 
1 ; which method comprises a step of comparing a nucleotide sequence of at least one 
nucleic acid molecule in said sample with a sequence selected from said group and 

10 determining whether the sequence of said nucleic acid molecule in said sample is at 
least 95% identical to said selected sequence. 

Also preferred is the above method wherein said step of comparing sequences 
comprises determining the extent of nucleic acid hybridization between nucleic acid 
molecules in said sample and a nucleic acid molecule comprising said sequence 

15 selected from said group. Similarly, also preferred is the above method wherein said 
step of comparing sequences is performed by comparing the nucleotide sequence 
determined from a nucleic acid molecule in said sample with said sequence selected 
from said group. The nucleic acid molecules can comprise DNA molecules or RNA 
molecules. 

20 A further preferred embodiment is a method for identifying the species, tissue 

or cell type of a biological sample which method comprises a step of detecting nucleic 
acid molecules in said sample, if any, comprising a nucleotide sequence that is at least 
95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X 

25 wherein X is any integer as defined in Table 1 ; and a nucleotide sequence encoded by 
a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 
1. 

The method for identifying the species, tissue or cell type of a biological 
30 sample can comprise a step of detecting nucleic acid molecules comprising a 

nucleotide sequence in a panel of at least two nucleotide sequences, wherein at least 
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one sequence in said panel is at least 95% identical to a sequence of at least 50 

contiguous nucleotides in a sequence selected from said group. 

Also preferred is a method for diagnosing in a subject a pathological condition 

associated with abnormal structure or expression of a gene encoding a secreted 
5 protein identified in Table 1, which method comprises a step of detecting in a 

biological sample obtained from said subject nucleic acid molecules, if any, 

comprising a nucleotide sequence that is at least 95% identical to a sequence of at 

least 50 contiguous nucleotides in a sequence selected from the group consisting of: a 

nucleotide sequence of SEQ ID NO:X wherein X is any integer as defined in Table 1; 
10 and a nucleotide sequence encoded by a human cDNA clone identified by a cDNA 

Clone Identifier in Table 1 and contained in the deposit with the ATCC Deposit 

Number shown for said cDNA clone in Table 1 . 

The method for diagnosing a pathological condition can comprise a step of 

detecting nucleic acid molecules comprising a nucleotide sequence in a panel of at 
15 least two nucleotide sequences, wherein at least one sequence in said panel is at least 

95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 

selected from said group. 

Also preferred is a composition of matter comprising isolated nucleic acid 

molecules wherein the nucleotide sequences of said nucleic acid molecules comprise 
20 a panel of at least two nucleotide sequences, wherein at least one sequence in said 

panel is at least 95% identical to a sequence of at least 50 contiguous nucleotides in a 

sequence selected from the group consisting of: a nucleotide sequence of SEQ ID 

NO:X wherein X is any integer as defined in Table 1; and a nucleotide sequence 

encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 
25 and contained in the deposit with the ATCC Deposit Number shown for said cDNA 

clone in Table 1 . The nucleic acid molecules can comprise DN A molecules or RNA 

molecules. 

Also preferred is an isolated polypeptide comprising an amino acid sequence 
at least 90% identical to a sequence of at least about 10 contiguous amino acids in the 
30 amino acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table 1 . 



BNSDOCID: <WO 9947540A1_I_> 



WO 99/47540 PCT/US99/05804 

228 

Also preferred is a polypeptide, wherein said sequence of contiguous amino 
acids is included in the amino acid sequence of SEQ ID NO: Y in the range of 
positions beginning with the residue at about the position of the First Amino Acid of 
the Secreted Portion and ending with the residue at about the Last Amino Acid of the 
5 Open Reading Frame as set forth for SEQ ID NO:Y in Table 1. 

Also preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid 
10 sequence at least 95% identical to a sequence of at least about 100 contiguous amino 
acids in the amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid 
sequence at least 95% identical to the complete amino acid sequence of SEQ ID 
NO:Y. 

15 Further preferred is an isolated polypeptide comprising an amino acid 

sequence at least 90% identical to a sequence of at least about 10 contiguous amino 
acids in the complete amino acid sequence of a secreted protein encoded by a human 
cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in the 
deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1. 

20 Also preferred is a polypeptide wherein said sequence of contiguous amino 

acids is included in the amino acid sequence of a secreted portion of the secreted 
protein encoded by a human cDNA clone identified by a cDNA Clone Identifier in 
Table 1 and contained in the deposit with the ATCC Deposit Number shown for said 
cDNA clone in Table 1. 

25 Also preferred is an isolated polypeptide comprising an amino acid sequence 

at least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
amino acid sequence of the secreted portion of the protein encoded by a human cDNA 
clone identified by a cDNA Clone Identifier in Table 1 and contained in the deposit 
with the ATCC Deposit Number shown for said cDNA clone in Table 1. 

30 Also preferred is an isolated polypeptide comprising an amino acid sequence 

at least 95% identical to a sequence of at least about 100 contiguous amino acids in 
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the amino acid sequence of the secreted portion of the protein encoded by a human 
cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in the 
deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1. 

Also preferred is an isolated polypeptide comprising an amino acid sequence 
5 at least 95% identical to the amino acid sequence of the secreted portion of the protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 
and contained in the deposit with the ATCC Deposit Number shown for said cDNA 
clone in Table 1 . 

Further preferred is an isolated antibody which binds specifically to a 
10 polypeptide comprising an amino acid sequence that is at least 90% identical to a 

sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as 
defined in Table 1 ; and a complete amino acid sequence of a protein encoded by a 
human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
15 in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 
1. 

Further preferred is a method for detecting in a biological sample a 
polypeptide comprising an amino acid sequence which is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the group 

20 consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as 
defined in Table 1 ; and a complete amino acid sequence of a protein encoded by a 
human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 
1 ; which method comprises a step of comparing an amino acid sequence of at least 

25 one polypeptide molecule in said sample with a sequence selected from said group 
and determining whether the sequence of said polypeptide molecule in said sample is 
at least 90% identical to said sequence of at least 10 contiguous amino acids. 

Also preferred is the above method wherein said step of comparing an amino 
acid sequence of at least one polypeptide molecule in said sample with a sequence 

30 selected from said group comprises determining the extent of specific binding of 

polypeptides in said sample to an antibody which binds specifically to a polypeptide 
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comprising an amino acid sequence that is at least 90% identical to a sequence of at 
least 10 contiguous amino acids in a sequence selected from the group consisting of: 
an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in 
Table 1; and a complete amino acid sequence of a protein encoded by a human cDNA 
5 clone identified by a cDNA Clone Identifier in Table 1 and contained in the deposit 
with the ATCC Deposit Number shown for said cDNA clone in Table 1. 

Also preferred is the above method wherein said step of comparing sequences 
is performed by comparing the amino acid sequence determined from a polypeptide 
molecule in said sample with said sequence selected from said group. 
10 Also preferred is a method for identifying the species, tissue or cell type of a 

biological sample which method comprises a step of detecting polypeptide molecules 
in said sample, if any, comprising an amino acid sequence that is at least 90% 
identical to a sequence of at least 10 contiguous amino acids in a sequence selected 
from the group consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is 
15 any integer as defined in Table 1; and a complete amino acid sequence of a secreted 
protein encoded by a human cDNA clone identified by a cDNA Clone Identifier in 
Table 1 and contained in the deposit with the ATCC Deposit Number shown for said 
cDNA clone in Table 1. 

Also preferred is the above method for identifying the species, tissue or cell 
20 type of a biological sample, which method comprises a step of detecting polypeptide 
molecules comprising an amino acid sequence in a panel of at least two amino acid 
sequences, wherein at least one sequence in said panel is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the above 
group. 

25 Also preferred is a method for diagnosing in a subject a pathological condition 

associated with abnormal structure or expression of a gene encoding a secreted 
protein identified in Table 1, which method comprises a step of detecting in a 
biological sample obtained from said subject polypeptide molecules comprising an 
amino acid sequence in a panel of at least two amino acid sequences, wherein at least 

30 one sequence in said panel is at least 90% identical to a sequence of at least 10 

contiguous amino acids in a sequence selected from the group consisting of: an amino 
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acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1; and a 
complete amino acid sequence of a secreted protein encoded by a human cDN A clone 
identified by a cDN A Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1. 
5 In any of these methods, the step of detecting said polypeptide molecules 

includes using an antibody. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a nucleotide sequence encoding a 
polypeptide wherein said polypeptide comprises an amino acid sequence that is at 

10 least 90% identical to a sequence of at least 10 contiguous amino acids in a sequence 
selected from the group consisting of: an amino acid sequence of SEQ ID NO: Y 
wherein Y is any integer as defined in Table 1 ; and a complete amino acid sequence 
of a secreted protein encoded by a human cDNA clone identified by a cDNA Clone 
Identifier in Table 1 and contained in the deposit with the ATCC Deposit Number 

1 5 shown for said cDN A clone in Table 1 . 

Also preferred is an isolated nucleic acid molecule, wherein said nucleotide 
sequence encoding a polypeptide has been optimized for expression of said 
polypeptide in a prokaryotic host; 

Also preferred is an isolated nucleic acid molecule, wherein said polypeptide 

20 comprises an amino acid sequence selected from the group consisting of: an amino 
acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1; and a 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1 . 

25 Further preferred is a method of making a recombinant vector comprising 

inserting any of the above isolated nucleic acid molecule into a vector. Also preferred 
is the recombinant vector produced by this method. Also preferred is a method of 
making a recombinant host cell comprising introducing the vector into a host cell, as 
well as the recombinant host cell produced by this method. 

30 Also preferred is a method of making an isolated polypeptide comprising 

culturing this recombinant host cell under conditions such that said polypeptide is 
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expressed and recovering said polypeptide. Also preferred is this method of making 
an isolated polypeptide, wherein said recombinant host cell is a eukaryotic cell and 
said polypeptide is a secreted portion of a human secreted protein comprising an 
amino acid sequence selected from the group consisting of: an amino acid sequence of 
SEQ ID NO: Y beginning with the residue at the position of the First Amino Acid of 
the Secreted Portion of SEQ ID NO: Y wherein Y is an integer set forth in Table 1 and 
said position of the First Amino Acid of the Secreted Portion of SEQ ID NO: Y is 
defined in Table 1 ; and an amino acid sequence of a secreted portion of a protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 
and contained in the deposit with the ATCC Deposit Number shown for said cDNA 
clone in Table 1 . The isolated polypeptide produced by this method is also preferred. 

Also preferred is a method of treatment of an individual in need of an 
increased level of a secreted protein activity, which method comprises administering 
to such an individual a pharmaceutical composition comprising an amount of an 
isolated polypeptide, polynucleotide, or antibody of the claimed invention effective to 
increase the level of said protein activity in said individual. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way of 
illustration and are not intended as limiting. 

Examples 

Example 1: Isolation of a Selecte d cDNA Clone From the Deposited Samp le 

Each cDNA clone in a cited ATCC deposit is contained in a plasmid vector. 
Table 1 identifies the vectors used to construct the cDNA library from which each 
clone was isolated. In many cases, the vector used to construct the library is a phage 
vector from which a plasmid has been excised. The table immediately below 
correlates the related plasmid for each phage vector used in constructing the cDNA 
library. For example, where a particular clone is identified in Table 1 as being 
isolated in the vector "Lambda Zap," the corresponding deposited clone is in 
"pBluescript." 
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Vector Used to Construct Library 



Corresponding Deposited 



Plasmid 



pSportl 

pCMVSport 2.0 
pCMVSport 3.0 
pCR®2.1 



Lambda Zap 
Uni-Zap XR 
Zap Express 



lafmid BA 



pBluescript (pBS) 
pBluescript (pBS) 
pBK 

plafmid BA 
pSportl 

pCMVSport 2.0 
pCMVSport 3.0 



10 



15 



20 



25 



Vectors Lambda Zap (U.S. Patent Nos. 5,128,256 and 5,286,636), Uni-Zap 
XR (U.S. Patent Nos. 5,128, 256 and 5,286,636), Zap Express (U.S. Patent Nos. 
5,128,256 and 5,286,636), pBluescript (pBS) (Short, J. M. et al., Nucleic Acids Res. 
16:7583-7600 (1988); Alting-Mees, M. A. and Short, J. M., Nucleic Acids Res. 
17:9494 (1989)) and pBK (Alting-Mees, M. A. et al., Strategies 5:58-61 (1992)) are 
commercially available from Stratagene Cloning Systems, Inc., 1 101 1 N. Torrey 
Pines Road, La Jolla, CA, 92037. pBS contains an ampicillin resistance gene and 
pBK contains a neomycin resistance gene. Both can be transformed into E. coli strain 
XL-1 Blue, also available from Stratagene. pBS comes in 4 forms SK+, SK-, KS+ 
and KS. The S and K refers to the orientation of the poly linker to the T7 and T3 
primer sequences which flank the poly linker region ("S" is for SacI and "K" is for 
Kpnl which are the first sites on each respective end of the linker). "+" or refer to 
the orientation of the f 1 origin of replication ("ori"), such that in one orientation, 
single stranded rescue initiated from the f 1 ori generates sense strand DNA and in the 
other, antisense. 

Vectors pSportl, pCMVSport 2.0 and pCMVSport 3.0, were obtained from 
Life Technologies, Inc., P. O. Box 6009, Gaithersburg, MD 20897. All Sport vectors 
contain an ampicillin resistance gene and may be transformed into E. coli strain 
DH10B, also available from Life Technologies. (See, for instance, Gruber, C. E., et 
al., Focus 15:59 (1993).) Vector lafmid BA (Bento Soares, Columbia University, 
NY) contains an ampicillin resistance gene and can be transformed into E. coli strain 
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XL-1 Blue. Vector pCR®2.1, which is available from Invitrogen, 1600 Faraday 
Avenue, Carlsbad, CA 92008, contains an ampicillin resistance gene and may be 
transformed into E. coli strain DH10B, available from Life Technologies. (See, for 
instance, Clark, J. M., Nuc. Acids Res. 16:9677-9686 (1988) and Mead, D. et al., 
5 Biotechnology 9: (1991).) Preferably, a polynucleotide of the present invention 
does not comprise the phage vector sequences identified for the particular clone in 
Table 1, as well as the corresponding plasmid vector sequences designated above. 

The deposited material in the sample assigned the ATCC Deposit Number 
cited in Table 1 for any given cDNA clone also may contain one or more additional 
10 plasmids, each comprising a cDNA clone different from that given clone. Thus, 

deposits sharing the same ATCC Deposit Number contain at least a plasmid for each 
cDNA clone identified in Table 1. Typically, each ATCC deposit sample cited in 
Table 1 comprises a mixture of approximately equal amounts (by weight) of about 50 
plasmid DNAs, each containing a different cDNA clone; but such a deposit sample 
15 may include plasmids for more or less than 50 cDNA clones, up to about 500 cDNA 
clones. 

Two approaches can be used to isolate a particular clone from the deposited 
sample of plasmid DNAs cited for that clone in Table 1. First, a plasmid is directly 
isolated by screening the clones using a polynucleotide probe corresponding to SEQ 
20 ID NO:X. 

Particularly, a specific polynucleotide with 30-40 nucleotides is synthesized 
using an Applied Biosystems DNA synthesizer according to the sequence reported. 
The oligonucleotide is labeled, for instance, with 32 P-y-ATP using T4 polynucleotide 
kinase and purified according to routine methods. (E.g., Maniatis et al., Molecular 

25 Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982).) 
The plasmid mixture is transformed into a suitable host, as indicated above (such as 
XL-1 Blue (Stratagene)) using techniques known to those of skill in the art, such as 
those provided by the vector supplier or in related publications or patents cited above. 
The transformants are plated on 1.5% agar plates (containing the appropriate selection 

30 agent, e.g., ampicillin) to a density of about 150 transformants (colonies) per plate. 
These plates are screened using Nylon membranes according to routine methods for 
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bacterial colony screening (e.g., Sambrook et al., Molecular Cloning: A Laboratory 
Manual, 2nd Edit., (1989), Cold Spring Harbor Laboratory Press, pages 1.93 to 
1.104), or other techniques known to those of skill in the art. 

Alternatively, two primers of 17-20 nucleotides derived from both ends of the 
5 SEQ ID NO:X (i.e., within the region of SEQ ID NO:X bounded by the 5' NT and the 
3' NT of the clone defined in Table 1) are synthesized and used to amplify the desired 
cDNA using the deposited cDNA plasmid as a template. The polymerase chain 
reaction is carried out under routine conditions, for instance, in 25 \il of reaction 
mixture with 0.5 ug of the above cDNA template. A convenient reaction mixture is 

10 1 .5-5 mM MgCl 2 , 0.01 % (w/v) gelatin, 20 |iM each of dATP, dCTP, dGTP, dTTP, 25 
pmol of each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PCR 
(denaturation at 94°C for 1 min; annealing at 55°C for 1 min; elongation at 72°C for 1 
min) are performed with a Perkin-Elmer Cetus automated thermal cycler. The 
amplified product is analyzed by agarose gel electrophoresis and the DNA band with 

15 expected molecular weight is excised and purified. The PCR product is verified to be 
the selected sequence by subcloning and sequencing the DNA product. 

Several methods are available for the identification of the 5' or 3' non-coding 
portions of a gene which may not be present in the deposited clone. These methods 
include but are not limited to, filter probing, clone enrichment using specific probes, 

20 and protocols similar or identical to 5' and 3' "RACE" protocols which are well 
known in the art. For instance, a method similar to 5* RACE is available for 
generating the missing 5' end of a desired full-length transcript. (Fromont-Racine et 
al., Nucleic Acids Res. 21(7): 1683-1684 (1993).) 

Briefly, a specific RNA oligonucleotide is ligated to the 5' ends of a 

25 population of RNA presumably containing full-length gene RNA transcripts. A 
primer set containing a primer specific to the ligated RNA oligonucleotide and a 
primer specific to a known sequence of the gene of interest is used to PCR amplify 
the 5' portion of the desired full-length gene. This amplified product may then be 
sequenced and used to generate the full length gene. 

30 This above method starts with total RNA isolated from the desired source, 

although poly-A+ RNA can be used. The RNA preparation can then be treated with 
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phosphatase if necessary to eliminate 5' phosphate groups on degraded or damaged 
RNA which may interfere with the later RNA ligase step. The phosphatase should 
then be inactivated and the RNA treated with tobacco acid pyrophosphatase in order 
to remove the cap structure present at the 5' ends of messenger RNAs. This reaction 
leaves a 5' phosphate group at the 5' end of the cap cleaved RNA which can then be 
ligated to an RNA oligonucleotide using T4 RNA ligase. 

This modified RNA preparation is used as a template for first strand cDNA 
synthesis using a gene specific oligonucleotide. The first strand synthesis reaction is 
used as a template for PCR amplification of the desired 5' end using a primer specific 
to the ligated RNA oligonucleotide and a primer specific to the known sequence of 
the gene of interest. The resultant product is then sequenced and analyzed to confirm 
that the 5' end sequence belongs to the desired gene. 

Example 2: Isolation of Geno mic Clones Corresponding to a Polynucleotide 

A human genomic PI library (Genomic Systems, Inc.) is screened by PCR 
using primers selected for the cDNA sequence corresponding to SEQ ID NO:X., 
according to the method described in Example 1. (See also, Sambrook.) 

Example 3: Tissue Distribution of Polypeptide 

Tissue distribution of mRNA expression of polynucleotides of the present 
invention is determined using protocols for Northern blot analysis, described by, 
among others, Sambrook et al. For example, a cDNA probe produced by the method 
described in Example 1 is labeled with P 32 using the rediprime™ DNA labeling 
system (Amersham Life Science), according to manufacturer's instructions. After 
labeling, the probe is purified using CHROMA SPIN- 100™ column (Clontech 
Laboratories, Inc.), according to manufacturer's protocol number PT1200-1. The 
purified labeled probe is then used to examine various human tissues for mRNA 
expression. 

Multiple Tissue Northern (MTN) blots containing various human tissues (H) 
or human immune system tissues (IM) (Clontech) are examined with the labeled 
probe using ExpressHyb™ hybridization solution (Clontech) according to 
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manufacturer's protocol number PT1 190-1. Following hybridization and washing, the 
blots are mounted and exposed to film at -70°C overnight, and the films developed 
according to standard procedures. 

5 Example 4: Chromosomal Mapp in g of the Polynucleotides 

An oligonucleotide primer set is designed according to the sequence at the 5* 
end of SEQ ID NO:X. This primer preferably spans about 100 nucleotides. This 
primer set is then used in a polymerase chain reaction under the following set of 
conditions : 30 seconds, 95°C; 1 minute, 56°C; 1 minute, 70°C. This cycle is 

10 repeated 32 times followed by one 5 minute cycle at 70°C. Human, mouse, and 

hamster DNA is used as template in addition to a somatic cell hybrid panel containing 
individual chromosomes or chromosome fragments (Bios, Inc). The reactions is 
analyzed on either 8% polyacrylamide gels or 3.5 % agarose gels. Chromosome 
mapping is determined by the presence of an approximately 100 bp PCR fragment in 

15 the particular somatic cell hybrid. 

Example 5; Bacterial Expression of a Polypeptide 

A polynucleotide encoding a polypeptide of the present invention is amplified 
using PCR oligonucleotide primers corresponding to the 5' and 3' ends of the DNA 
20 sequence, as outlined in Example 1, to synthesize insertion fragments. The primers 
used to amplify the cDNA insert should preferably contain restriction sites, such as 
BamHI and Xbal, at the 5* end of the primers in order to clone the amplified product 
into the expression vector. For example, BamHI and Xbal correspond to the 
restriction enzyme sites on the bacterial expression vector pQE-9. (Qiagen, Inc., 

25 Chatsworth, CA). This plasmid vector encodes antibiotic resistance (Amp r )» a 

bacterial origin of replication (ori), an IPTG-regulatable promoter/operator (P/O), a 
ribosome binding site (RBS), a 6-histidine tag (6-His), and restriction enzyme cloning 
sites. 

The pQE-9 vector is digested with BamHI and Xbal and the amplified 
30 fragment is ligated into the pQE-9 vector maintaining the reading frame initiated at 
the bacterial RBS. The ligation mixture is then used to transform the E. coli strain 
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M15/rep4 (Qiagen, Inc.) which contains multiple copies of the plasmid pREP4, which 
expresses the lad repressor and also confers kanamycin resistance (Kan r ). 
Transformants are identified by their ability to grow on LB plates and 
ampicillin/kanamycin resistant colonies are selected. Plasmid DNA is isolated and 
5 confirmed by restriction analysis. 

Clones containing the desired constructs are grown overnight (O/N) in liquid 
culture in LB media supplemented with both Amp (100 ug/ml) and Kan (25 ug/ml). 
The O/N culture is used to inoculate a large culture at a ratio of 1 : 100 to 1 :250. The 
cells are grown to an optical density 600 (O.D. 600 ) of between 0.4 and 0.6. IPTG 
10 (Isopropyl-B-D-thiogalacto pyranoside) is then added to a final concentration of 1 
mM. IPTG induces by inactivating the lad repressor, clearing the P/O leading to 
increased gene expression. 

Cells are grown for an extra 3 to 4 hours. Cells are then harvested by 
centrifugation (20 mins at 6000Xg). The cell pellet is solubilized in the chaotropic 
15 agent 6 Molar Guanidine HC1 by stirring for 3-4 hours at 4°C. The cell debris is 

removed by centrifugation, and the supernatant containing the polypeptide is loaded 
onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column (available from 
QIAGEN, Inc., supra). Proteins with a 6 x His tag bind to the Ni-NTA resin with 
high affinity and can be purified in a simple one-step procedure (for details see: The 
20 QIAexpressionist (1995) QIAGEN, Inc., supra). 

Briefly, the supernatant is loaded onto the column in 6 M guanidine-HCl, pH 
8, the column is first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then 
washed with 10 volumes of 6 M guanidine-HCl pH 6, and finally the polypeptide is 
eluted with 6 M guanidine-HCl, pH 5. 
25 The purified protein is then renatured by dialyzing it against phosphate- 

buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCl. 
Alternatively, the protein can be successfully refolded while immobilized on the Ni- 
NTA column. The recommended conditions are as follows: renature using a linear 
6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH 7.4, 
30 containing protease inhibitors. The renaturation should be performed over a period of 
1.5 hours or more. After renaturation the proteins are eluted by the addition of 250 
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mM immidazole. Immidazole is removed by a final dialyzing step against PBS or 50 
mM sodium acetate pH 6 buffer plus 200 mM NaCl. The purified protein is stored at 
4° Cor frozen at -80° C. 

In addition to the above expression vector, the present invention further 
5 includes an expression vector comprising phage operator and promoter elements 

operatively linked to a polynucleotide of the present invention, called pHE4a. (ATCC 
Accession Number 209645, deposited on February 25, 1998.) This vector contains: 
1) a neomycinphosphotransferase gene as a selection marker, 2) an E. coli origin of 
replication, 3) a T5 phage promoter sequence, 4) two lac operator sequences, 5) a 

10 Shine-Delgarno sequence, and 6) the lactose operon repressor gene (laclq). The 
origin of replication (oriC) is derived from pUC19 (LTI, Gaithersburg, MD). The 
promoter sequence and operator sequences are made synthetically. 

DNA can be inserted into the pHEa by restricting the vector with Ndel and 
Xbal, BamHI, Xhol, or Asp718, running the restricted product on a gel, and isolating 

15 the larger fragment (the stuffer fragment should be about 310 base pairs). The DNA 
insert is generated according to the PCR protocol described in Example 1, using PCR 
primers having restriction sites for Ndel (5' primer) and Xbal, BamHI, Xhol, or 
Asp718 (3' primer). The PCR insert is gel purified and restricted with compatible 
enzymes. The insert and vector are ligated according to standard protocols. 

20 The engineered vector could easily be substituted in the above protocol to 

express protein in a bacterial system. 

Example 6: Purification of a Polypeptide from an Inclusion Body 

The following alternative method can be used to purify a polypeptide 
25 expressed in E coli when it is present in the form of inclusion bodies. Unless 
otherwise specified, all of the following steps are conducted at 4-10°C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled to 4-10°C and the cells harvested by continuous centrifugation at 
15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit 
30 weight of cell paste and the amount of purified protein required, an appropriate 

amount of cell paste, by weight, is suspended in a buffer solution containing 100 mM 
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Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension 
using a high shear mixer. 

The cells are then lysed by passing the solution through a microfluidizer 
(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate 
5 is then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 xg for 15 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

The resulting washed inclusion bodies are solubilized with 1.5 M guanidine 
hydrochloride (GuHCl) for 2-4 hours. After 7000 xg centrifugation for 15 min., the 
10 pellet is discarded and the polypeptide containing supernatant is incubated at 4°C 
overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 xg) to remove insoluble particles, 
the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 
20 volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA 
1 5 by vigorous stirring. The refolded diluted protein solution is kept at 4°C without 
mixing for 12 hours prior to further purification steps. 

To clarify the refolded polypeptide solution, a previously prepared tangential 
filtration unit equipped with 0.16 u.m membrane filter with appropriate surface area 
(e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The 
20 filtered sample is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive 
Biosystems). The column is washed with 40 mM sodium acetate, pH 6.0 and eluted 
with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same buffer, in a 
stepwise manner. The absorbance at 280 nm of the effluent is continuously 
monitored. Fractions are collected and further analyzed by SDS-PAGE. 
25 Fractions containing the polypeptide are then pooled and mixed with 4 

volumes of water. The diluted sample is then loaded onto a previously prepared set of 
tandem columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak 
anion (Poros CM-20, Perseptive Biosystems) exchange resins. The columns are 
equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 
30 mM sodium acetate, pH 6.0, 200 mM NaCl. The CM-20 column is then eluted using 
a 10 column volume linear gradient ranging from 0.2 M NaCl, 50 mM sodium 
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acetate, pH 6.0 to 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are 
collected under constant A 280 monitoring of the effluent. Fractions containing the 
polypeptide (determined, for instance, by 16% SDS-PAGE) are then pooled. 

The resultant polypeptide should exhibit greater than 95% purity after the 
5 above refolding and purification steps. No major contaminant bands should be 

observed from Commassie blue stained 16% SDS-PAGE gel when 5 |j,g of purified 
protein is loaded. The purified protein can also be tested for endotoxin/LPS 
contamination, and typically the LPS content is less than 0. 1 ng/ml according to LAL 
assays. 

10 

Example 7: Cloning and Expression of a Polypeptide in a Baculovirus 
Expression System 

In this example, the plasmid shuttle vector pA2 is used to insert a 
polynucleotide into a baculovirus to express a polypeptide. This expression vector 

15 contains the strong polyhedrin promoter of the Autographa calif ornica nuclear 
polyhedrosis virus (AcMNPV) followed by convenient restriction sites such as 
BamHI, Xba I and Asp718. The polyadenylation site of the simian virus 40 ("SV40") 
is used for efficient polyadenylation. For easy selection of recombinant virus, the 
plasmid contains the beta-galactosidase gene from E. coli under control of a weak 

20 Drosophila promoter in the same orientation, followed by the polyadenylation signal 
of the polyhedrin gene. The inserted genes are flanked on both sides by viral 
sequences for cell-mediated homologous recombination with wild-type viral DNA to 
generate a viable virus that express the cloned polynucleotide. 

Many other baculovirus vectors can be used in place of the vector above, such 

25 as pAc373, pVL941, and pAcIMl, as one skilled in the art would readily appreciate, 
as long as the construct provides appropriately located signals for transcription, 
translation, secretion and the like, including a signal peptide and an in-frame AUG as 
required. Such vectors are described, for instance, in Luckow et al., Virology 170:31- 
39(1989). 

30 Specifically, the cDNA sequence contained in the deposited clone, including 

the AUG initiation codon and the naturally associated leader sequence identified in 
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Table 1, is amplified using the PCR protocol described in Example 1. If the naturally 
occurring signal sequence is used to produce the secreted protein, the pA2 vector does 
not need a second signal peptide. Alternatively, the vector can be modified (pA2 GP) 
to include a baculovirus leader sequence, using the standard methods described in 
5 Summers et ah, "A Manual of Methods for Baculovirus Vectors and Insect Cell 
Culture Procedures," Texas Agricultural Experimental Station Bulletin No. 1555 
(1987). 

The amplified fragment is isolated from a 1% agarose gel using a 
commercially available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The fragment 
10 then is digested with appropriate restriction enzymes and again purified on a 1% 
agarose gel. 

The plasmid is digested with the corresponding restriction enzymes and 
optionally, can be dephosphorylated using calf intestinal phosphatase, using routine 
procedures known in the art. The DN A is then isolated from a 1 % agarose gel using a 
15 commercially available kit ("Geneclean" BIO 101 Inc., La Jolla, Ca.). 

The fragment and the dephosphorylated plasmid are ligated together with T4 
DNA ligase. E. coli HB101 or other suitable E. coli hosts such as XL-1 Blue 
(Stratagene Cloning Systems, La Jolla, CA) cells are transformed with the ligation 
mixture and spread on culture plates. Bacteria containing the plasmid are identified 
20 by digesting DNA from individual colonies and analyzing the digestion product by 
gel electrophoresis. The sequence of the cloned fragment is confirmed by DNA 
sequencing. 

Five |ig of a plasmid containing the polynucleotide is co-transfected with 1 .0 
jig of a commercially available linearized baculovirus DNA ("BaculoGold™ 

25 baculovirus DNA", Pharmingen, San Diego, CA), using the lipofection method 

described by Feigner et al., Proc. Natl. Acad. Sci. USA 84:7413-7417 (1987). One ^ig 
of BaculoGold™ virus DNA and 5 p.g of the plasmid are mixed in a sterile well of a 
microtiter plate containing 50 \il of serum-free Grace's medium (Life Technologies 
Inc., Gaithersburg, MD). Afterwards, 10 fxl Lipofectin plus 90 fil Grace's medium are 

30 added, mixed and incubated for 15 minutes at room temperature. Then the 

transfection mixture is added drop- wise to Sf9 insect cells (ATCC CRL 1711) seeded 
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in a 35 mm tissue culture plate with 1 ml Grace's medium without serum. The plate is 
then incubated for 5 hours at 27° C. The transfection solution is then removed from 
the plate and 1 ml of Grace's insect medium supplemented with 10% fetal calf serum 
is added. Cultivation is then continued at 27° C for four days. 
5 After four days the supernatant is collected and a plaque assay is performed, 

as described by Summers and Smith, supra. An agarose gel with "Blue Gal" (Life 
Technologies Inc., Gaithersburg) is used to allow easy identification and isolation of 
gal-expressing clones, which produce blue-stained plaques. (A detailed description of 
a "plaque assay" of this type can also be found in the user's guide for insect cell 

10 culture and baculovirology distributed by Life Technologies Inc., Gaithersburg, page 
9-10.) After appropriate incubation, blue stained plaques are picked with the tip of a 
micropipettor (e.g., Eppendorf). The agar containing the recombinant viruses is then 
resuspended in a microcentrifuge tube containing 200 \\\ of Grace's medium and the 
suspension containing the recombinant baculovirus is used to infect Sf9 cells seeded 

15 in 35 mm dishes. Four days later the supernatants of these culture dishes are 
harvested and then they are stored at 4° C. 

To verify the expression of the polypeptide, Sf9 cells are grown in Grace's 
medium supplemented with 10% heat-inactivated FBS. The cells are infected with 
the recombinant baculovirus containing the polynucleotide at a multiplicity of 

20 infection ("MOI") of about 2. If radiolabeled proteins are desired, 6 hours later the 
medium is removed and is replaced with SF900 II medium minus methionine and 
cysteine (available from Life Technologies Inc., Rockville, MD). After 42 hours, 5 
|iCi of 35 S-methionine and 5 |iCi 35 S-cysteine (available from Amersham) are added. 
The cells are further incubated for 16 hours and then are harvested by centrifugation. 

25 The proteins in the supernatant as well as the intracellular proteins are analyzed by 
SDS-PAGE followed by autoradiography (if radiolabeled). 

Microsequencing of the amino acid sequence of the amino terminus of 
purified protein may be used to determine the amino terminal sequence of the 
produced protein. 

30 Example 8: Expression of a Polypeptide in Mammalian Cells 
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The polypeptide of the present invention can be expressed in a mammalian 
cell. A typical mammalian expression vector contains a promoter element, which 
mediates the initiation of transcription of mRNA, a protein coding sequence, and 
signals required for the termination of transcription and polyadenylation of the 
5 transcript. Additional elements include enhancers, Kozak sequences and intervening 
sequences flanked by donor and acceptor sites for RNA splicing. Highly efficient 
transcription is achieved with the early and late promoters from SV40, the long 
terminal repeats (LTRs) from Retroviruses, e.g., RS V, HTLVI, HIVI and the early 
promoter of the cytomegalovirus (CMV). However, cellular elements can also be 
10 used (e.g., the human actin promoter). 

Suitable expression vectors for use in practicing the present invention include, 
for example, vectors such as pS VL and pMSG (Pharmacia, Uppsala, Sweden), 
pRSVcat (ATCC 37152), pSV2dhfr (ATCC 37146), pBC12MI (ATCC 67109), 
pCMVSport 2.0, and pCMVSport 3.0. Mammalian host cells that could be used 
15 include, human Hela, 293, H9 and Jurkat cells, mouse NIH3T3 and C127 cells, Cos 1, 
Cos 7 and CV1, quail QC1-3 cells, mouse L cells and Chinese hamster ovary (CHO) 
cells. 

Alternatively, the polypeptide can be expressed in stable cell lines containing 
the polynucleotide integrated into a chromosome. The co-transfection with a 

20 selectable marker such as dhfr, gpt, neomycin, hygromycin allows the identification 
and isolation of the transfected cells. 

The transfected gene can also be amplified to express large amounts of the 
encoded protein. The DHFR (dihydrofolate reductase) marker is useful in developing 
cell lines that carry several hundred or even several thousand copies of the gene of 

25 interest. (See, e.g., Alt, F. W., et al., J. Biol. Chem. 253:1357-1370 (1978); Hamlin, J. 
L. and Ma, C, Biochem. et Biophys. Acta, 1097:107-143 (1990); Page, M. J. and 
Sydenham, M. A., Biotechnology 9:64-68 (1991).) Another useful selection marker 
is the enzyme glutamine synthase (GS) (Murphy et al., Biochem J. 227:277-279 
(1991); Bebbington et al., Bio/Technology 10:169-175 (1992). Using these markers, 

30 the mammalian cells are grown in selective medium and the cells with the highest 

resistance are selected. These cell lines contain the amplified gene(s) integrated into a 
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chromosome. Chinese hamster ovary (CHO) and NSO cells are often used for the 
production of proteins. 

Derivatives of the plasmid pSV2-dhfr (ATCC Accession No. 37146), the 
expression vectors pC4 (ATCC Accession No. 209646) and pC6 (ATCC Accession 
5 No.209647) contain the strong promoter (LTR) of the Rous Sarcoma Virus (Cullen et 
al., Molecular and Cellular Biology, 438-447 (March, 1985)) plus a fragment of the 
CMV-enhancer (Boshart et al., Cell 41:521-530 (1985).) Multiple cloning sites, e.g., 
with the restriction enzyme cleavage sites BamHI, Xbal and Asp718, facilitate the 
cloning of the gene of interest. The vectors also contain the 3' intron, the 

10 polyadenylation and termination signal of the rat preproinsulin gene, and the mouse 
DHFR gene under control of the SV40 early promoter. 

Specifically, the plasmid pC6, for example, is digested with appropriate 
restriction enzymes and then dephosphorylated using calf intestinal phosphates by 
procedures known in the art. The vector is then isolated from a 1% agarose gel. 

15 A polynucleotide of the present invention is amplified according to the 

protocol outlined in Example 1. If the naturally occurring signal sequence is used to 
produce the secreted protein, the vector does not need a second signal peptide. 
Alternatively, if the naturally occurring signal sequence is not used, the vector can be 
modified to include a heterologous signal sequence. (See, e.g., WO 96/34891.) 

20 The amplified fragment is isolated from a 1% agarose gel using a 

commercially available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The fragment 
then is digested with appropriate restriction enzymes and again purified on a 1 % 
agarose gel. 

The amplified fragment is then digested with the same restriction enzyme and 
25 purified on a 1% agarose gel. The isolated fragment and the dephosphorylated vector 
are then ligated with T4 DNA ligase. E. coli HB101 or XL-1 Blue cells are then 
transformed and bacteria are identified that contain the fragment inserted into plasmid 
pC6 using, for instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene is used for 
30 transfection. Five |lg of the expression plasmid pC6 is cotransfected with 0.5 |lg of 
the plasmid pSVneo using lipofectin (Feigner et al., supra). The plasmid pSV2-neo 
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contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme 
that confers resistance to a group of antibiotics including G418. The cells are seeded 
in alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the cells are 
trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha 
5 minus MEM supplemented with 10, 25, or 50 ng/ml of metothrexate plus 1 mg/ml 
G418. After about 10-14 days single clones are trypsinized and then seeded in 6-well 
petri dishes or 10 ml flasks using different concentrations of methotrexate (50 nM, 
100 nM, 200 nM, 400 nM, 800 nM). Clones growing at the highest concentrations of 
methotrexate are then transferred to new 6-well plates containing even higher 
10 concentrations of methotrexate (1 uM, 2 uM, 5 uM, 10 mM, 20 mM). The same 

procedure is repeated until clones are obtained which grow at a concentration of 100 - 
200 uM. Expression of the desired gene product is analyzed, for instance, by SDS- 
PAGE and Western blot or by reversed phase HPLC analysis. 

15 Example 9 : Protein Fusions 

The polypeptides of the present invention are preferably fused to other 
proteins. These fusion proteins can be used for a variety of applications. For 
example, fusion of the present polypeptides to His-tag, HA-tag, protein A, IgG 
domains, and maltose binding protein facilitates purification. (See Example 5; see 

20 also EP A 394,827; Traunecker, et al., Nature 331:84-86 (1988).) Similarly, fusion to 
IgG-1, IgG-3, and albumin increases the halfiife time in vivo. Nuclear localization 
signals fused to the polypeptides of the present invention can target the protein to a 
specific subcellular localization, while covalent heterodimer or homodimers can 
increase or decrease the activity of a fusion protein. Fusion proteins can also create 

25 chimeric molecules having more than one function. Finally, fusion proteins can 
increase solubility and/or stability of the fused protein compared to the non-fused 
protein. All of the types of fusion proteins described above can be made by 
modifying the following protocol, which outlines the fusion of a polypeptide to an 
IgG molecule, or the protocol described in Example 5. 

30 Briefly, the human Fc portion of the IgG molecule can be PCR amplified, 

using primers that span the 5' and 3' ends of the sequence described below. These 
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primers also should have convenient restriction enzyme sites that will facilitate 
cloning into an expression vector, preferably a mammalian expression vector. 

For example, if pC4 (Accession No. 209646) is used, the human Fc portion 
can be ligated into the BamHI cloning site. Note that the 3' BamHI site should be 
5 destroyed. Next, the vector containing the human Fc portion is re-restricted with 

BamHI, linearizing the vector, and a polynucleotide of the present invention, isolated 
by the PCR protocol described in Example 1, is ligated into this BamHI site. Note 
that the polynucleotide is cloned without a stop codon, otherwise a fusion protein will 
not be produced. 

10 If the naturally occurring signal sequence is used to produce the secreted 

protein, pC4 does not need a second signal peptide. Alternatively, if the naturally 
occurring signal sequence is not used, the vector can be modified to include a 
heterologous signal sequence. (See, e.g., WO 96/34891.) 

15 Human IgG Fc region: 

GGGATCCGGAGCCCAAATCTTCTGACAAAACTCACACATGCCCACCGTGC 
CCAGCACCTGAATTCGAGGGTGCACCGTCAGTCTTCCTCTTCCCCCCAAAA 
CCCAAGGACACCCTCATGATCTCCCGGACTCCTGAGGTCACATGCGTGGT 
GGTGGACGTAAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGG 

20 ACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTA 
CAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACT 
GGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCA 
ACCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAAC 
CACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAG 

25 GTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCAAGCGACATCGCCGT 
GGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCT 
CCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTG 
GACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCA 
TGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGG 

30 GTAA ATG AGTGCGACGGCCGCGACTCTAGAGGAT (SEQ ID NO: 1 ) 
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Example 10: Production of an Antibody from a Polypeptide 

The antibodies of the present invention can be prepared by a variety of 
methods. (See, Current Protocols, Chapter 2.) For example, cells expressing a 
polypeptide of the present invention is administered to an animal to induce the 
production of sera containing polyclonal antibodies. In a preferred method, a 
preparation of the secreted protein is prepared and purified to render it substantially 
free of natural contaminants. Such a preparation is then introduced into an animal in 
order to produce polyclonal antisera of greater specific activity. 

In the most preferred method, the antibodies of the present invention are 
monoclonal antibodies (or protein binding fragments thereof). Such monoclonal 
antibodies can be prepared using hybridoma technology. (Kohler et aL, Nature 
256:495 (1975); Kohler et al., Eur. J. Immunol. 6:51 1 (1976); Kohler et al., Eur. J. 
Immunol. 6:292 (1976); Hammerling et al., in: Monoclonal Antibodies and T-Cell 
Hybridomas, Elsevier, N.Y., pp. 563-681 (1981).) In general, such procedures 
involve immunizing an animal (preferably a mouse) with polypeptide or, more 
preferably, with a secreted polypeptide-expressing cell. Such cells may be cultured in 
any suitable tissue culture medium; however, it is preferable to culture cells in Earle's 
modified Eagle's medium supplemented with 10% fetal bovine serum (inactivated at 
about 56°C), and supplemented with about 10 g/1 of nonessential amino acids, about 
1,000 U/ml of penicillin, and about 100 |lg/ml of streptomycin. 

The splenocytes of such mice are extracted and fused with a suitable myeloma 
cell line. Any suitable myeloma cell line may be employed in accordance with the 
present invention; however, it is preferable to employ the parent myeloma cell line 
(SP20), available from the ATCC. After fusion, the resulting hybridoma cells are 
selectively maintained in HAT medium, and then cloned by limiting dilution as 
described by Wands et al. (Gastroenterology 80:225-232 (1981).) The hybridoma 
cells obtained through such a selection are then assayed to identify clones which 
secrete antibodies capable of binding the polypeptide. 

Alternatively, additional antibodies capable of binding to the polypeptide can 
be produced in a two-step procedure using anti-idiotypic antibodies. Such a method 
makes use of the fact that antibodies are themselves antigens, and therefore, it is 
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possible to obtain an antibody which binds to a second antibody. In accordance with 
this method, protein specific antibodies are used to immunize an animal, preferably a 
mouse. The splenocytes of such an animal are then used to produce hybridoma cells, 
and the hybridoma cells are screened to identify clones which produce an antibody 
5 whose ability to bind to the protein-specific antibody can be blocked by the 
polypeptide. Such antibodies comprise anti-idiotypic antibodies to the protein- 
specific antibody and can be used to immunize an animal to induce formation of 
further protein-specific antibodies. 

It will be appreciated that Fab and F(ab')2 and other fragments of the 

10 antibodies of the present invention may be used according to the methods disclosed 
herein. Such fragments are typically produced by proteolytic cleavage, using 
enzymes such as papain (to produce Fab fragments) or pepsin (to produce F(ab')2 
fragments). Alternatively, secreted protein-binding fragments can be produced 
through the application of recombinant DNA technology or through synthetic 

15 chemistry. 

For in vivo use of antibodies in humans, it may be preferable to use 
"humanized" chimeric monoclonal antibodies. Such antibodies can be produced 
using genetic constructs derived from hybridoma cells producing the monoclonal 
antibodies described above. Methods for producing chimeric antibodies are known in 
20 the art. (See, for review, Morrison, Science 229:1202 (1985); Oi et aL, 

BioTechniques 4:214 (1986); Cabilly et aL, U.S. Patent No. 4,816,567; Taniguchi et 
aL, EP 171496; Morrison et aL, EP 173494; Neuberger et aL, WO 8601533; Robinson 
et aL, WO 8702671; Boulianne et aL, Nature 312:643 (1984); Neuberger et aL, Nature 
314:268(1985).) 

25 

Example 11: Production Of Secreted Protein For High-Throughput Screening 
Assays 

The following protocol produces a supernatant containing a polypeptide to be 
tested. This supernatant can then be used in the Screening Assays described in 
30 Examples 13-20. 
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First, dilute Poly-D-Lysine (644 587 Boehringer-Mannheim) stock solution 
(Img/ml in PBS) 1:20 in PBS (w/o calcium or magnesium 17-5 16F Biowhittaker) for 
a working solution of 50ug/mL Add 200 ul of this solution to each well (24 well 
plates) and incubate at RT for 20 minutes. Be sure to distribute the solution over each 
5 well (note: a 12-channel pipetter may be used with tips on every other channel). 

Aspirate off the Poly-D-Lysine solution and rinse with 1ml PBS (Phosphate Buffered 
Saline). The PBS should remain in the well until just prior to plating the cells and 
plates may be poly-lysine coated in advance for up to two weeks. 

Plate 293T cells (do not carry cells past P+20) at 2 x 10 5 cells/well in .5ml 
10 DMEM(Dulbecco's Modified Eagle Medium)(with 4.5 G/L glucose and L-glutamine 
(12-604F Biowhittaker))/ 10% heat inactivated FBS(14-503F Biowhittaker)/lx 
Penstrep(17-602E Biowhittaker). Let the cells grow overnight. 

The next day, mix together in a sterile solution basin: 300 ul Lipofectamine 
(18324-012 Gibco/BRL) and 5ml Optimem I (31985070 Gibco/BRL)/96-well plate. 
15 With a small volume multi-channel pipetter, aliquot approximately 2ug of an 
expression vector containing a polynucleotide insert, produced by the methods 
described in Examples 8 or 9, into an appropriately labeled 96-well round bottom 
plate. With a multi-channel pipetter, add 50ul of the Lipofectamine/Optimem I 
mixture to each well. Pipette up and down gently to mix. Incubate at RT 15-45 
20 minutes. After about 20 minutes, use a multi-channel pipetter to add 150ul Optimem 
I to each well. As a control, one plate of vector DNA lacking an insert should be 
transfected with each set of transfections. 

Preferably, the transfection should be performed by tag-teaming the following 
tasks. By tag-teaming, hands on time is cut in half, and the cells do not spend too 
25 much time on PBS. First, person A aspirates off the media from four 24-well plates 
of cells, and then person B rinses each well with .5- lml PBS. Person A then aspirates 
off PBS rinse, and person B, using al2-channel pipetter with tips on every other 
channel, adds the 200ul of DNA/Lipofectamine/Optimem I complex to the odd wells 
first, then to the even wells, to each row on the 24-well plates. Incubate at 37°C for 6 
30 hours. 
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While cells are incubating, prepare appropriate media, either 1%BSA in 
DMEM with lx penstrep, or CHO-5 media (1 16.6 mg/L of CaC12 (anhyd); 0.00130 
mg/L CuS0 4 -5H 2 0; 0.050 mg/L of Fe(N0 3 ) 3 -9H 2 0; 0.417 mg/L of FeS0 4 -7H 2 0; 
3 1 1.80 mg/L of Kcl; 28.64 mg/L of MgCl 2 ; 48.84 mg/L of MgS0 4 ; 6995.50 mg/L of 
5 NaCl; 2400.0 mg/L of NaHCO,; 62.50 mg/L of NaH 2 PO 4 -H 2 0; 7 1 .02 mg/L of 

Na 2 HP04; .4320 mg/L of ZnS0 4 -7H 2 0; .002 mg/L of Arachidonic Acid ; 1 .022 mg/L 
of Cholesterol; .070 mg/L of DL-alpha-Tocopherol- Acetate; 0.0520 mg/L of Linoleic 
Acid; 0.010 mg/L of Linolenic Acid; 0.010 mg/L of Myristic Acid; 0.010 mg/L of 
Oleic Acid; 0.010 mg/L of Palmitric Acid; 0.010 mg/L of Palmitic Acid; 100 mg/L of 

10 Pluronic F-68; 0.010 mg/L of Stearic Acid; 2.20 mg/L of Tween 80; 4551 mg/L of D- 
Glucose; 130.85 mg/ml of L- Alanine; 147.50 mg/ml of L-Arginine-HCL; 7.50 mg/ml 
of L- Asparagine-H 2 0; 6.65 mg/ml of L-Aspartic Acid; 29.56 mg/ml of L-Cystine- 
2HCL-H 2 0; 31.29 mg/ml of L-Cystine-2HCL; 7.35 mg/ml of L-Glutamic Acid; 365.0 
mg/ml of L-Glutamine; 18.75 mg/ml of Glycine; 52.48 mg/ml of L-Histidine-HCL- 

15 H 2 0; 106.97 mg/ml of L-Isoleucine; 1 1 1.45 mg/ml of L-Leucine; 163.75 mg/ml of L- 
Lysine HCL; 32.34 mg/ml of L-Methionine; 68.48 mg/ml of L-Phenylalainine; 40.0 
mg/ml of L-Proline; 26.25 mg/ml of L-Serine; 101.05 mg/ml of L-Threonine; 19.22 
mg/ml of L-Tryptophan; 91.79 mg/ml of L-Tryrosine-2Na-2H 2 0; 99.65 mg/ml of L- 
Valine; 0.0035 mg/L of Biotin; 3.24 mg/L of D-Ca Pantothenate; 1 1.78 mg/L of 

20 Choline Chloride; 4.65 mg/L of Folic Acid; 15.60 mg/L of i-Inositol; 3.02 mg/L of 
Niacinamide; 3.00 mg/L of Pyridoxal HCL; 0.031 mg/L of Pyridoxine HCL; 0.319 
mg/L of Riboflavin; 3.17 mg/L of Thiamine HCL; 0.365 mg/L of Thymidine; and 
0.680 mg/L of Vitamin B 12 ; 25 mM of HEPES Buffer; 2.39 mg/L of Na 
Hypoxanthine; 0.105 mg/L of Lipoic Acid; 0.081 mg/L of Sodium Putrescine-2HCL; 

25 55.0 mg/L of Sodium Pyruvate; 0.0067 mg/L of Sodium Selenite; 20uM of 

Ethanolamine; 0.122 mg/L of Ferric Citrate; 41.70 mg/L of Methyl-B-Cyclodextrin 
complexed with Linoleic Acid; 33.33 mg/L of Methyl-B-Cyclodextrin complexed 
with Oleic Acid; and 10 mg/L of Methyl-B-Cyclodextrin complexed with Retinal) 
with 2mm glutamine and lx penstrep. (BSA (81-068-3 Bayer) lOOgm dissolved in 1L 

30 DMEM for a 10% BSA stock solution). Filter the media and collect 50 ul for 
endotoxin assay in 15ml polystyrene conical. 
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The transfection reaction is terminated, preferably by tag-teaming, at the end 
of the incubation period. Person A aspirates off the transfection media, while person 
B adds 1.5ml appropriate media to each well. Incubate at 37°C for 45 or 72 hours 
depending on the media used: 1%BSA for 45 hours or CHO-5 for 72 hours. 
5 On day four, using a 300ul multichannel pipetter, aliquot 600ul in one 1ml 

deep well plate and the remaining supernatant into a 2ml deep well. The supernatants 
from each well can then be used in the assays described in Examples 13-20. 

It is specifically understood that when activity is obtained in any of the assays 
described below using a supernatant, the activity originates from either the 
10 polypeptide directly (e.g., as a secreted protein) or by the polypeptide inducing 

expression of other proteins, which are then secreted into the supernatant. Thus, the 
invention further provides a method of identifying the protein in the supernatant 
characterized by an activity in a particular assay. 



15 Example 12: Constru ction of GAS Reporter Construct 

One signal transduction pathway involved in the differentiation and 
proliferation of cells is called the Jaks-STATs pathway. Activated proteins in the 
Jaks-STATs pathway bind to gamma activation site "GAS" elements or interferon- 
sensitive responsive element ("ISRE"), located in the promoter of many genes. The 

20 binding of a protein to these elements alter the expression of the associated gene. 

GAS and ISRE elements are recognized by a class of transcription factors 
called Signal Transducers and Activators of Transcription, or "STATs." There are six 
members of the STATs family. Statl and Stat3 are present in many cell types, as is 
Stat2 (as response to IFN-alpha is widespread). Stat4 is more restricted and is not in 

25 many cell types though it has been found in T helper class I, cells after treatment with 
IL-12. StatS was originally called mammary growth factor, but has been found at 
higher concentrations in other cells including myeloid cells. It can be activated in 
tissue culture cells by many cytokines. 

The STATs are activated to translocate from the cytoplasm to the nucleus 

30 upon tyrosine phosphorylation by a set of kinases known as the Janus Kinase ("Jaks") 
family. Jaks represent a distinct family of soluble tyrosine kinases and include Tyk2, 
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Jakl, Jak2, and Jak3. These kinases display significant sequence similarity and are 
generally catalytically inactive in resting cells. 

The Jaks are activated by a wide range of receptors summarized in the Table 
below. (Adapted from review by Schidler and Darnell, Ann. Rev. Biochem. 64:621- 
5 51 (1995).) A cytokine receptor family, capable of activating Jaks, is divided into two 
groups: (a) Class 1 includes receptors for IL-2, IL-3, IL-4, IL-6, IL-7, IL-9, IL-1 1, IL- 
12, IL-15, Epo, PRL, GH, G-CSF, GM-CSF, LIF, CNTF, and thrombopoietin; and (b) 
Class 2 includes IFN-a, IFN-g, and IL-10. The Class 1 receptors share a conserved 
cysteine motif (a set of four conserved cysteines and one tryptophan) and a WSXWS 
10 motif (a membrane proximal region encoding Trp-Ser-Xxx-Trp-Ser (SEQ ID NO:2)). 

Thus, on binding of a ligand to a receptor, Jaks are activated, which in turn 
activate STATs, which then translocate and bind to GAS elements. This entire 
process is encompassed in the Jaks-STATs signal transduction pathway. 

Therefore, activation of the Jaks-STATs pathway, reflected by the binding of 
15 the GAS or the IS RE element, can be used to indicate proteins involved in the 

proliferation and differentiation of cells. For example, growth factors and cytokines 
are known to activate the Jaks-STATs pathway. (See Table below.) Thus, by using 
GAS elements linked to reporter molecules, activators of the Jaks-STATs pathway 
can be identified. 
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Ligand 

IFN family 
IFN-a/B 
IFN-g 
11-10 

gp!30 family 

IL-6 (Pleiotrophic) 

Il-ll(Pleiotrophic) 

OnM(Pleiotrophic) 

LIF(Pleiotrophic) 

CNTF(Pleiotrophic) 

G-CSF(Pleiotrophic) 

IL-12(Pleiotrophic) 



tvk2 



+ 

7 

? 
7 

-/+ 

9 



g-C family 
IL-2 (lymphocytes) 
IL-4 (lymph/myeloid) - 
IL-7 (lymphocytes) 
IL-9 (lymphocytes) 
IL-13 (lymphocyte) 
iL-15 ? 

gp!40 family 
IL-3 (myeloid) 
IL-5 (myeloid) 
GM-CSF (myeloid) 

Growth hormone family 
GH ? 
PRL ? 
EPO ? 

Receptor Tyrosine Kinases 
EGF ? 
PDGF ? 
CSF-1 ? 



JAKs 
Jakl 



+ 
+ 

9 



Jak2 Jak3 



STATS GASfelements^ or ISRE 



12,3 
1 

1,3 



ISRE 

GAS (IRFl>Lys6>IFP) 



+ 


+ 


? 


13 


GAS (IRFl>Lys6>IFP) 


t 






1,3 




+ 


+ 


9 


1,3 




+ 


+ 


7 


13 




+ 


+ 


o 

f 


13 




+ 


7 


? 


13 






+ 


+ 


13 




+ 




+ 


l,3p 


GAS 


+ 


- 




6 


GAS (IRF1 = IFP »Ly6)(IgH) 


+ 




+ 


5 


GAS 


+ 




+ 


5 


GAS 


i 


9 


7 


o 


UAiS 


+ 


7 


+ 


5 


GAS 




+ 




5 


GAS (IRFl>IFP»Ly6) 




+ 




5 


GAS 




+ 




5 


GAS 




-f 




5 




+/- 


+ 




1,3 ,5 






+ 




5 


GAS(B-CAS>IRF1 =IFP»Ly6) 




+ 




1,3 


GAS (IRF1) 


+ 


+ 




1,3 




+ 


+ 




1,3 


GAS (not IRF1) 



SUBSTITUTE SHEET (RULE 26) 
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To construct a synthetic GAS containing promoter element, which is used in 
the Biological Assays described in Examples 13-14, a PCR based strategy is 
employed to generate a GAS-SV40 promoter sequence. The 5' primer contains four 
tandem copies of the GAS binding site found in the IRF1 promoter and previously 
5 demonstrated to bind STATs upon induction with a range of cytokines (Rothman et 
al., Immunity 1:457-468 (1994).), although other GAS or ISRE elements can be used 
instead. The 5' primer also contains 18bp of sequence complementary to the SV40 
early promoter sequence and is flanked with an Xhol site. The sequence of the 5' 
primer is: 

10 5 * : GCGCCTCG AG ATTTCCCCG A A ATCT AG ATTTCCCCG A A ATG ATTTCCCC 
GAAATGATTTCCCCGAAATATCTGCCATCTCAATTAG:3' (SEQ ID NO:3) 

The downstream primer is complementary to the SV40 promoter and is 
flanked with a Hind III site: 5 ' :GCGGCAAGCTTTTTGC AAAGCCTAGGC: 3 9 
(SEQ ID NO:4) 

15 PCR amplification is performed using the SV40 promoter template present in 

the B-gal:promoter plasmid obtained from Clontech. The resulting PCR fragment is 
digested with Xhol/Hind III and subcloned into BLSK2-. (Stratagene.) Sequencing 
with forward and reverse primers confirms that the insert contains the following 
sequence: 

20 5 ' : CTCGAG ATTTCCCCG A A ATCT AG ATTTCCCCG A A ATG ATTTCCCCG A A A 
TGATTTCCCCGAAATATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCG 
CCCCTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCT 



TCGGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCT 

25 AGGCTTTTGC AA AAAGCTT :3 * (SEQIDNO:5) 

With this GAS promoter element linked to the SV40 promoter, a GAS:SEAP2 
reporter construct is next engineered. Here, the reporter molecule is a secreted 
alkaline phosphatase, or "SEAP " Clearly, however, any reporter molecule can be 
instead of SEAP, in this or in any of the other Examples. Well known reporter 

30 molecules that can be used instead of SEAP include chloramphenicol 
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acetyltransferase (CAT), luciferase, alkaline phosphatase, B-galactosidase, green 
fluorescent protein (GFP), or any protein detectable by an antibody. 

The above sequence confirmed synthetic GAS-S V40 promoter element is 
subcloned into the pSEAP-Promoter vector obtained from Clontech using Hindlll and 
5 Xhol, effectively replacing the SV40 promoter with the amplified GAS:SV40 

promoter element, to create the GAS-SEAP vector. However, this vector does not 
contain a neomycin resistance gene, and therefore, is not preferred for mammalian 
expression systems. 

Thus, in order to generate mammalian stable cell lines expressing the GAS- 
10 SEAP reporter, the GAS-SEAP cassette is removed from the GAS-SEAP vector using 
Sail and NotI, and inserted into a backbone vector containing the neomycin resistance 
gene, such as pGFP-1 (Clontech), using these restriction sites in the multiple cloning 
site, to create the GAS-SEAP/Neo vector. Once this vector is transfected into 
mammalian cells, this vector can then be used as a reporter molecule for GAS binding 
15 as described in Examples 13-14. 

Other constructs can be made using the above description and replacing GAS 
with a different promoter sequence. For example, construction of reporter molecules 
containing NFK-B and EGR promoter sequences are described in Examples 15 and 
16. However, many other promoters can be substituted using the protocols described 
20 in these Examples. For instance, SRE, JL-2, NFAT, or Osteocalcin promoters can be 
substituted, alone or in combination (e.g., GAS/NF-KB/EGR, GAS/NF-KB, U- 
2/NFAT, or NF-KB/GAS). Similarly, other cell lines can be used to test reporter 
construct activity, such as HELA (epithelial), HUVEC (endothelial), Reh (B-cell), 
Saos-2 (osteoblast), HUVAC (aortic), or Cardiomyocyte. 

25 

Example 13; High-Throughput Screeni ng Assay fnr T-cell Activity. 

The following protocol is used to assess T-cell activity by identifying factors, 
such as growth factors and cytokines, that may proliferate or differentiate T-cells. T- 
cell activity is assessed using the GAS/SEAP/Neo construct produced in Example 12. 
30 Thus, factors that increase SEAP activity indicate the ability to activate the Jaks- 
STATS signal transduction pathway. The T-cell used in this assay is Jurkat T-cells 
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(ATCC Accession No. TIB- 152), although Molt-3 cells (ATCC Accession No. CRL- 
1552) and Molt-4 cells (ATCC Accession No. CRL-1582) cells can also be used. 

Jurkat T-cells are lymphoblastic CD4+ Thl helper cells. In order to generate 
stable cell lines, approximately 2 million Jurkat cells are transfected with the GAS- 
5 SEAP/neo vector using DMRIE-C (Life Technologies)(transfection procedure 
described below). The transfected cells are seeded to a density of approximately 
20,000 cells per well and transfectants resistant to 1 mg/ml genticin selected. 
Resistant colonies are expanded and then tested for their response to increasing 
concentrations of interferon gamma. The dose response of a selected clone is 
10 demonstrated. 

Specifically, the following protocol will yield sufficient cells for 75 wells 
containing 200 ul of cells. Thus, it is either scaled up, or performed in multiple to 
generate sufficient cells for multiple 96 well plates. Jurkat cells are maintained in 
RPMI + 10% serum with l%Pen-Strep. Combine 2.5 mis of OPTI-MEM (Life 
15 Technologies) with 10 ug of plasmid DNA in a T25 flask. Add 2.5 ml OPTI-MEM 
containing 50 ul of DMRIE-C and incubate at room temperature for 15-45 mins. 

During the incubation period, count cell concentration, spin down the required 
number of cells (10 7 per transfection), and resuspend in OPTI-MEM to a final 
concentration of 10 7 cells/ml. Then add 1ml of 1 x 10 7 cells in OPTI-MEM to T25 
20 flask and incubate at 37°C for 6 hrs. After the incubation, add 10 ml of RPMI + 15% 
serum. 

The Jurkat:GAS-SEAP stable reporter lines are maintained in RPMI + 10% 
serum, 1 mg/ml Genticin, and 1% Pen-Strep. These cells are treated with 
supernatants containing a polypeptide as produced by the protocol described in 
25 Example 1 1 . 

On the day of treatment with the supernatant, the cells should be washed and 
resuspended in fresh RPMI + 10% serum to a density of 500,000 cells per ml. The 
exact number of cells required will depend on the number of supernatants being 
screened. For one 96 well plate, approximately 10 million cells (for 10 plates, 100 
30 million cells) are required. 
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Transfer the cells to a triangular reservoir boat, in order to dispense the cells 
into a 96 well dish, using a 12 channel pipette. Using a 12 channel pipette, transfer 
200 ul of cells into each well (therefore adding 100, 000 cells per well). 

After all the plates have been seeded, 50 ul of the supernatants are transferred 
directly from the 96 well plate containing the supernatants into each well using a 12 
channel pipette. In addition, a dose of exogenous interferon gamma (0.1, 1.0, 10 ng) 
is added to wells H9, H10, and HI 1 to serve as additional positive controls for the 
assay. 

The 96 well dishes containing Jurkat cells treated with supernatants are placed 
in an incubator for 48 hrs (note: this time is variable between 48-72 hrs). 35 ul 
samples from each well are then transferred to an opaque 96 well plate using a 12 
channel pipette. The opaque plates should be covered (using sellophene covers) and 
stored at -20°C until SEAP assays are performed according to Example 17. The 
plates containing the remaining treated cells are placed at 4°C and serve as a source 
of material for repeating the assay on a specific well if desired. 

As a positive control, 100 Unit/ml interferon gamma can be used which is 
known to activate Jurkat T cells. Over 30 fold induction is typically observed in the 
positive control wells. 

The above protocol may be used in the generation of both transient, as well as, 
stable transfected cells, which would be apparent to those of skill in the art. 

Example 14: High-Throughput Screening Assay Identifying Myeloid Activity 

The following protocol is used to assess myeloid activity by identifying 
factors, such as growth factors and cytokines, that may proliferate or differentiate 
myeloid cells. Myeloid cell activity is assessed using the GAS/SEAP/Neo construct 
produced in Example 12. Thus, factors that increase SEAP activity indicate the 
ability to activate the Jaks-STATS signal transduction pathway. The myeloid cell 
used in this assay is U937, a pre-monocyte cell line, although TF-1, HL60, or KG1 
can be used. 

To transiently transfect U937 cells with the GAS/SEAP/Neo construct 
produced in Example 12, a DEAE-Dextran method (Kharbanda et. ah, 1994, Cell 
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Growth & Differentiation, 5:259-265) is used. First, harvest 2x1 Oe 7 U937 cells and 
wash with PBS. The U937 cells are usually grown in RPMI 1640 medium containing 
10% heat-inactivated fetal bovine serum (FBS) supplemented with 100 units/ml 
penicillin and 100 mg/ml streptomycin. 
5 Next, suspend the cells in 1 ml of 20 mM Tris-HCl (pH 7.4) buffer containing 

0.5 mg/ml DEAE-Dextran, 8 ug GAS-SEAP2 plasmid DNA, 140 mM NaCl, 5 mM 

KC1, 375 uM Na 2 HP0 4 .7H 2 0, 1 mM MgCl 2 , and 675 uM CaCl 2 . Incubate at 37°C 
for 45 min. 

Wash the cells with RPMI 1640 medium containing 10% FBS and then 

10 resuspend in 10 ml complete medium and incubate at 37°C for 36 hr. 

The GAS-SEAP/U937 stable cells are obtained by growing the cells in 400 
ug/ml G418. The G418-free medium is used for routine growth but every one to two 
months, the cells should be re-grown in 400 ug/ml G418 for couple of passages. 

These cells are tested by harvesting 1x10 cells (this is enough for ten 96- well 
15 plates assay) and wash with PBS. Suspend the cells in 200 ml above described 

growth medium, with a final density of 5xl0 5 cells/ml. Plate 200 ul cells per well in 
the 96- well plate (or 1x10 s cells/well). 

Add 50 ul of the supernatant prepared by the protocol described in Example 

11. Incubate at 37°C for 48 to 72 hr. As a positive control, 100 Unit/ml interferon 
20 gamma can be used which is known to activate U937 cells. Over 30 fold induction is 
typically observed in the positive control wells. SEAP assay the supernatant 
according to the protocol described in Example 17. 

Example 15: High-Throughput Screening Assay Identifying Neuronal Activity, 

25 When cells undergo differentiation and proliferation, a group of genes are 

activated through many different signal transduction pathways. One of these genes, 
EGR1 (early growth response gene 1), is induced in various tissues and cell types 
upon activation. The promoter of EGR1 is responsible for such induction. Using the 
EGR1 promoter linked to reporter molecules, activation of cells can be assessed. 
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Particularly, the following protocol is used to assess neuronal activity in PC 12 
cell lines. PC 12 cells (rat phenochromocytoma cells) are known to proliferate and/or 
differentiate by activation with a number of mitogens, such as TPA (tetradecanoyl 
phorbol acetate), NGF (nerve growth factor), and EGF (epidermal growth factor). 
5 The EGR1 gene expression is activated during this treatment. Thus, by stably 

transfecting PC 12 cells with a construct containing an EGR promoter linked to SEAP 
reporter, activation of PC 12 cells can be assessed. 

The EGR/SEAP reporter construct can be assembled by the following 
protocol. The EGR-1 promoter sequence (-633 to +l)(Sakamoto K et ah, Oncogene 
10 6:867-87 1 (1991)) can be PCR amplified from human genomic DNA using the 
following primers: 

5' GCGCTCGAGGGATGACAGCGATAGAACCCCGG -3' (SEQ ID NO:6) 
5' GCGAAGCTTCGCGACTCCCCGGATCCGCCTC-3' (SEQ ID NO:7) 
Using the GAS:SEAP/Neo vector produced in Example 12, EGR1 amplified 
15 product can then be inserted into this vector. Linearize the GAS:SEAP/Neo vector 
using restriction enzymes Xhol/Hindlll, removing the GAS/S V40 stuffer. Restrict the 
EGR1 amplified product with these same enzymes. Ligate the vector and the EGR1 
promoter. 

To prepare 96 well-plates for cell culture, two mis of a coating solution (1 :30 
20 dilution of collagen type I (Upstate Biotech Inc. Cat#08-1 15) in 30% ethanol (filter 
sterilized)) is added per one 10 cm plate or 50 ml per well of the 96- well plate, and 
allowed to air dry for 2 hr. 

PC 12 cells are routinely grown in RPMI-1640 medium (Bio Whittaker) 
containing 10% horse serum (JRH BIOSCIENCES, Cat. # 12449-78P), 5% heat- 
25 inactivated fetal bovine serum (FBS) supplemented with 100 units/ml penicillin and 
100 ug/ml streptomycin on a precoated 10 cm tissue culture dish. One to four split is 
done every three to four days. Cells are removed from the plates by scraping and 
resuspended with pipetting up and down for more than 15 times. 

Transfect the EGR/SEAP/Neo construct into PC 12 using the Lipofectamine 
30 protocol described in Example 1 1 . EGR-SEAP/PC 1 2 stable cells are obtained by 
growing the cells in 300 ug/ml G418. The G418-free medium is used for routine 
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growth but every one to two months, the cells should be re-grown in 300 ug/ml G418 
for couple of passages. 

To assay for neuronal activity, a 10 cm plate with cells around 70 to 80% 
confluent is screened by removing the old medium. Wash the cells once with PBS 
5 (Phosphate buffered saline). Then starve the cells in low serum medium (RPMI-1640 
containing 1% horse serum and 0.5% FBS with antibiotics) overnight. 

The next morning, remove the medium and wash the cells with PBS. Scrape 
off the cells from the plate, suspend the cells well in 2 ml low serum medium. Count 

the cell number and add more low serum medium to reach final cell density as 5xl0 5 
10 cells/ml. 

Add 200 ul of the cell suspension to each well of 96-well plate (equivalent to 
lxlO 5 cells/well). Add 50 ul supernatant produced by Example 11, 37°C for 48 to 72 
hr. As a positive control, a growth factor known to activate PC 12 cells through EGR 
can be used, such as 50 ng/ul of Neuronal Growth Factor (NGF). Over fifty- fold 
15 induction of SEAP is typically seen in the positive control wells. SEAP assay the 
supernatant according to Example 17. 

Example 16: High -Throughput Screening Assay for T-cell Activity 

NF-kB (Nuclear Factor kB) is a transcription factor activated by a wide 
20 variety of agents including the inflammatory cytokines IL-1 and TNF, CD30 and 
CD40, lymphotoxin-alpha and lymphotoxin-beta, by exposure to LPS or thrombin, 
and by expression of certain viral gene products. As a transcription factor, NF-kB 
regulates the expression of genes involved in immune cell activation, control of 
apoptosis (NF- kB appears to shield cells from apoptosis), B and T-cell development, 
25 anti-viral and antimicrobial responses, and multiple stress responses. 

In non-stimulated conditions, NF- kB is retained in the cytoplasm with I-kB 
(Inhibitor kB). However, upon stimulation, I- kB is phosphorylated and degraded, 
causing NF- kB to shuttle to the nucleus, thereby activating transcription of target 
genes. Target genes activated by NF- kB include IL-2, IL-6, GM-CSF, ICAM-1 and 
30 class 1 MHC. 
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Due to its central role and ability to respond to a range of stimuli, reporter 
constructs utilizing the NF-kB promoter element are used to screen the supernatants 
produced in Example 1 1 . Activators or inhibitors of NF-kB would be useful in 
treating diseases. For example, inhibitors of NF-kB could be used to treat those 
5 diseases related to the acute or chronic activation of NF-kB, such as rheumatoid 
arthritis. 

To construct a vector containing the NF-kB promoter element, a PCR based 
strategy is employed. The upstream primer contains four tandem copies of the NF-kB 
binding site (GGGGACTTTCCC) (SEQ ID NO:8), 18 bp of sequence complementary 
10 to the 5' end of the SV40 early promoter sequence, and is flanked with an Xhol site: 
5 ' :GCGGCCTCGAGGGGACTTTCCCGGGGACTTTCCGGGGACTTTCCGGGAC 
TTTCC ATCCTGCC ATCTC A ATT AG : 3 * (SEQ ID NO:9) 

The downstream primer is complementary to the 3' end of the SV40 promoter 
and is flanked with a Hind III site: 
15 5^GCGGCAAGCTTTTTGCAAAGCCTAGGC:3' (SEQ ID NO:4) 

PCR amplification is performed using the SV40 promoter template present in 
the pB-gal:promoter plasmid obtained from Clontech. The resulting PCR fragment is 
digested with Xhol and Hind III and subcloned into BLSK2-. (Stratagene) 
Sequencing with the T7 and T3 primers confirms the insert contains the following 
20 sequence: 

5 ' :CTCG AGGGGACTTTCCCGGGGACTTTCCGGGGACTTTCCGGGACTTTCC 
ATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCGCCC 
ATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGA 
25 CTAAi I i l l l 1 1ATTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTA 
TTCCAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCTA 
GCTT:3' (SEQ ID NO: 10) 

Next, replace the S V40 minimal promoter element present in the pSEAP2- 
30 promoter plasmid (Clontech) with this NF-KB/SV40 fragment using Xhol and 
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Hindlll. However, this vector does not contain a neomycin resistance gene, and 
therefore, is not preferred for mammalian expression systems. 

In order to generate stable mammalian cell lines, the NF-KB/SV40/SEAP 
cassette is removed from the above NF-kB/SEAP vector using restriction enzymes 
5 Sail and NotI, and inserted into a vector containing neomycin resistance. Particularly, 
the NF-KB/SV40/SEAP cassette was inserted into pGFP-1 (Clontech), replacing the 
GFP gene, after restricting pGFP-1 with Sail and NotL 

Once NF-KB/SV40/SEAP/Neo vector is created, stable Jurkat T-cells are 
created and maintained according to the protocol described in Example 13. Similarly, 
10 the method for assaying supernatants with these stable Jurkat T-cells is also described 
in Example 13. As a positive control, exogenous TNF alpha (0.1,1, 10 ng) is added to 
wells H9, H10, and HI 1, with a 5-10 fold activation typically observed. 

Exam ple 17: Assay for SEAP Activity 

15 As a reporter molecule for the assays described in Examples 13-16, SEAP 

activity is assayed using the Tropix Phospho-light Kit (Cat. BP-400) according to the 
following general procedure. The Tropix Phospho-light Kit supplies the Dilution, 
Assay, and Reaction Buffers used below. 

Prime a dispenser with the 2.5x Dilution Buffer and dispense 15 \i\ of 2.5x 

20 dilution buffer into Optiplates containing 35 |Lll of a supernatant. Seal the plates with 
a plastic sealer and incubate at 65°C for 30 min. Separate the Optiplates to avoid 
uneven heating. 

Cool the samples to room temperature for 15 minutes. Empty the dispenser 
and prime with the Assay Buffer. Add 50 jil Assay Buffer and incubate at room 

25 temperature 5 min. Empty the dispenser and prime with the Reaction Buffer (see the 
table below). Add 50 |Xl Reaction Buffer and incubate at room temperature for 20 
minutes. Since the intensity of the chemiluminescent signal is time dependent, and it 
takes about 10 minutes to read 5 plates on luminometer, one should treat 5 plates at 
each time and start the second set 10 minutes later. 

30 Read the relative light unit in the luminometer. Set H12 as blank, and print 

the results. An increase in chemiluminescence indicates reporter activity. 
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Reaction Buffer Formulation: 

#of plates Rxn buffer diluent (ml) CSPD (ml) 



10 


60 


3 


11 


65 


3.25 


12 


70 


3.5 


13 


75 


3.75 


14 


80 


4 


15 


85 


4.25 


16 


90 


4.5 


17 


95 


4.75 


18 


100 


5 


19 


105 


5.25 


20 


110 


5.5 


21 


115 


5.75 


22 


120 


6 


23 


125 


6.25 


24 


130 


6.5 


25 


135 


6.75 


26 


140 


7 


27 


145 


7.25 


28 


150 


7.5 


29 


155 


7.75 


30 


160 


8 


31 


165 


8.25 


32 


170 


8.5 


33 


175 


8.75 


34 


180 


9 


35 


185 


9.25 


36 


190 


9.5 


37 


195 


9.75 


38 


200 


10 


39 


205 


10.25 


40 


210 


10.5 


41 


215 


10.75 


42 


220 


11 


43 


225 


11.25 


44 


230 


11.5 


45 


235 


11.75 


46 


240 


12 


47 


245 


12.25 


48 


250 


12.5 


49 


255 


12.75 


50 


260 


13 



Example 18: High-Throughput Screening Assay Identifying Changes in Small 
5 Molecule Concentration and Membrane Permeability 

Binding of a ligand to a receptor is known to alter intracellular levels of small 
molecules, such as calcium, potassium, sodium, and pH, as well as alter membrane 
potential. These alterations can be measured in an assay to identify supernatants 
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which bind to receptors of a particular cell. Although the following protocol 
describes an assay for calcium, this protocol can easily be modified to detect changes 
in potassium, sodium, pH, membrane potential, or any other small molecule which is 
detectable by a fluorescent probe. 

5 The following assay uses Fluorometric Imaging Plate Reader ("FLIPR") to 

measure changes in fluorescent molecules (Molecular Probes) that bind small 
molecules. Clearly, any fluorescent molecule detecting a small molecule can be used 
instead of the calcium fluorescent molecule, fluo-4 (ltolecular Probes, Inc.; 
catalog no. F-14202) , used here. 

10 For adherent cells, seed the cells at 10,000 -20,000 cells/well in a Co-star 

black 96-well plate with clear bottom. The plate is incubated in a C0 2 incubator for 
20 hours. The adherent cells are washed two times in Biotek washer with 200 ul of 
HBSS (Hank's Balanced Salt Solution) leaving 100 ul of buffer after the final wash. 
A stock solution of 1 mg/ml fluo-4 is made in 10% pluronic acid DMSO. To 

15 load the cells with fluo-4 , 50 ul of 12 ug/ml fluo-4 is added to each well. The plate 
is incubated at 37°C in a C0 2 incubator for 60 min. The plate is washed four times in 
the Biotek washer with HBSS leaving 100 ul of buffer. 

For non-adherent cells, the cells are spun down from culture media. Cells are 
re-suspended to 2-5xl0 6 cells/ml with HBSS in a 50-ml conical tube. 4 ul of 1 mg/ml 

20 fluo-4 solution in 10% pluronic acid DMSO is added to each ml of cell suspension. 
The tube is then placed in a 37°C water bath for 30-60 min. The cells are washed 
twice with HBSS, resuspended to IxlO 6 cells/ml, and dispensed into a microplate, 100 
ul/well. The plate is centrifuged at 1000 rpm for 5 min. The plate is then washed 
once in Denley CellWash with 200 ul, followed by an aspiration step to 100 ul final 

25 volume. 

For a non-cell based assay, each well contains a fluorescent molecule, such as 
fluo-4 . The supernatant is added to the well, and a change in fluorescence is 
detected. 

To measure the fluorescence of intracellular calcium, the FLIPR is set for the 
30 following parameters: (1) System gain is 300-800 mW; (2) Exposure time is 0.4 
second; (3) Camera F/stop is F/2; (4) Excitation is 488 nm; (5) Emission is 530 nm; 
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and (6) Sample addition is 50 ul. Increased emission at 530 nm indicates an 
extracellular signaling event which has resulted in an increase in the intracellular 

concentration. 

5 Example 19: High-Throughput Screening Assay Identifying Tyrosine Kinase 
Activity 

The Protein Tyrosine Kinases (PTK) represent a diverse group of 
transmembrane and cytoplasmic kinases. Within the Receptor Protein Tyrosine 
Kinase RPTK) group are receptors for a range of mitogenic and metabolic growth 

10 factors including the PDGF, FGF, EGF, NGF, HGF and Insulin receptor subfamilies. 
In addition there are a large family of RPTKs for which the corresponding ligand is 
unknown. Ligands for RPTKs include mainly secreted small proteins, but also 
membrane-bound and extracellular matrix proteins. 

Activation of RPTK by ligands involves ligand-mediated receptor 

15 dimerization, resulting in transphosphorylation of the receptor subunits and activation 
of the cytoplasmic tyrosine kinases. The cytoplasmic tyrosine kinases include 
receptor associated tyrosine kinases of the src-family (e.g., src, yes, lck, lyn, fyn) and 
non-receptor linked and cytosolic protein tyrosine kinases, such as the Jak family, 
members of which mediate signal transduction triggered by the cytokine superfamily 

20 of receptors (e.g., the Interleukins, Interferons, GM-CSF, and Leptin). 

Because of the wide range of known factors capable of stimulating tyrosine 
kinase activity, the identification of novel human secreted proteins capable of 
activating tyrosine kinase signal transduction pathways are of interest. Therefore, the 
following protocol is designed to identify those novel human secreted proteins 

25 capable of activating the tyrosine kinase signal transduction pathways. 

Seed target cells (e.g., primary keratinocytes) at a density of approximately 
25,000 cells per well in a 96 well Loprodyne Silent Screen Plates purchased from 
Nalge Nunc (Naperville, IL). The plates are sterilized with two 30 minute rinses with 
100% ethanol, rinsed with water and dried overnight. Some plates are coated for 2 hr 

30 with 100 ml of cell culture grade type I collagen (50 mg/ml), gelatin (2%) or 

polylysine (50 mg/ml), all of which can be purchased from Sigma Chemicals (St. 
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Louis, MO) or 10% Matrigel purchased from Becton Dickinson (Bedford,MA), or 

calf serum, rinsed with PBS and stored at 4°C. Cell growth on these plates is assayed 
by seeding 5,000 cells/well in growth medium and indirect quantitation of cell 
number through use of alamarBlue as described by the manufacturer Alamar 
5 Biosciences, Inc. (Sacramento, CA) after 48 hr. Falcon plate covers #3071 from 
Becton Dickinson (Bedford,MA) are used to cover the Loprodyne Silent Screen 
Plates. Falcon Microtest III cell culture plates can also be used in some proliferation 
experiments. 

To prepare extracts, A431 cells are seeded onto the nylon membranes of 
10 Loprodyne plates (20,000/200ml/well) and cultured overnight in complete medium. 
Cells are quiesced by incubation in serum-free basal medium for 24 hr. After 5-20 
minutes treatment with EGF (60ng/ml) or 50 ul of the supernatant produced in 
Example 11, the medium was removed and 100 ml of extraction buffer ((20 mM 
HEPES pH 7.5, 0.15 M NaCl, 1% Triton X-100, 0.1% SDS, 2 mM Na3V04, 2 mM 
15 Na4P207 and a cocktail of protease inhibitors (# 1836170) obtained from 

Boeheringer Mannheim (Indianapolis, IN) is added to each well and the plate is 

shaken on a rotating shaker for 5 minutes at 4°C. The plate is then placed in a 
vacuum transfer manifold and the extract filtered through the 0.45 mm membrane 
bottoms of each well using house vacuum. Extracts are collected in a 96- well 
20 catch/assay plate in the bottom of the vacuum manifold and immediately placed on 
ice. To obtain extracts clarified by centrifugation, the content of each well, after 
detergent solubilization for 5 minutes, is removed and centrifuged for 15 minutes at 

4°C at 16,000 xg. 

Test the filtered extracts for levels of tyrosine kinase activity. Although many 
25 methods of detecting tyrosine kinase activity are known, one method is described 
here. 

Generally, the tyrosine kinase activity of a supernatant is evaluated by 
determining its ability to phosphorylate a tyrosine residue on a specific substrate (a 
biotinylated peptide). Biotinylated peptides that can be used for this purpose include 
30 PSK1 (corresponding to amino acids 6-20 of the cell division kinase cdc2-p34) and 
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PSK2 (corresponding to amino acids 1-17 of gastrin). Both peptides are substrates for 
a range of tyrosine kinases and are available from Boehringer Mannheim. 

The tyrosine kinase reaction is set up by adding the following components in 
order. First, add lOul of 5uM Biotinylated Peptide, then lOul ATP/Mg2+ (5mM 
5 ATP/50mM MgCl2), then lOul of 5x Assay Buffer (40mM imidazole hydrochloride, 
pH7.3, 40 mM beta-glycerophosphate, ImM EGTA, lOOmM MgCl2, 5 mM MnC^, 
0.5 mg/ml BSA), then 5ul of Sodium Vanadate(lmM), and then 5ul of water. Mix the 

components gently and preincubate the reaction mix at 30°C for 2 min. Initial the 
reaction by adding lOul of the control enzyme or the filtered supernatant. 
10 The tyrosine kinase assay reaction is then terminated by adding 10 ul of 

120mm EDTA and place the reactions on ice. 

Tyrosine kinase activity is determined by transferring 50 ul aliquot of reaction 

mixture to a microtiter plate (MTP) module and incubating at 37°C for 20 min. This 
allows the streptavadin coated 96 well plate to associate with the biotinylated peptide. 
15 Wash the MTP module with 300ul/well of PBS four times. Next add 75 ul of anti- 
phospotyrosine antibody conjugated to horse radish peroxidase(anti-P-Tyr- 

POD(0.5u/ml)) to each well and incubate at 37°C for one hour. Wash the well as 
above. 

Next add lOOul of peroxidase substrate solution (Boehringer Mannheim) and 
20 incubate at room temperature for at least 5 mins (up to 30 min). Measure the 

absorbance of the sample at 405 nm by using ELIS A reader. The level of bound 
peroxidase activity is quantitated using an ELIS A reader and reflects the level of 
tyrosine kinase activity. 

25 Example 20: High-Throughput Screening Assay Identifying Phosphorylation 
Activity 

As a potential alternative and/or compliment to the assay of protein tyrosine 
kinase activity described in Example 19, an assay which detects activation 
(phosphorylation) of major intracellular signal transduction intermediates can also be 
30 used. For example, as described below one particular assay can detect tyrosine 
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phosphorylation of the Erk-1 and Erk-2 kinases. However, phosphorylation of other 
molecules, such as Raf, JNK, p38 MAP, Map kinase kinase (MEK), MEK kinase, 
Src, Muscle specific kinase (MuSK), IRAK, Tec, and Janus, as well as any other 
phosphoserine, phosphotyrosine, or phosphothreonine molecule, can be detected by 
substituting these molecules for Erk^l or Erk-2 in the following assay. 

Specifically, assay plates are made by coating the wells of a 96- well ELISA 
plate with 0.1ml of protein G (lug/ml) for 2 hr at room temp, (RT). The plates are 
then rinsed with PBS and blocked with 3% BS A/PBS for 1 hr at RT. The protein G 
plates are then treated with 2 commercial monoclonal antibodies (lOOng/well) against 
Erk-1 

and Erk-2 (1 hr at RT) (Santa Cruz Biotechnology). (To detect other molecules, this 
step can easily be modified by substituting a monoclonal antibody detecting any of 
the above described molecules.) After 3-5 rinses with PBS, the plates are stored at 
4°C until use. 

A431 cells are seeded at 20,000/well in a 96- well Loprodyne filterplate and 
cultured overnight in growth medium. The cells are then starved for 48 hr in basal 
medium (DMEM) and then treated with EGF (6ng/well) or 50 ul of the supernatants 
obtained in Example 1 1 for 5-20 minutes. The cells are then solubilized and extracts 
filtered directly into the assay plate. 

After incubation with the extract for 1 hr at RT, the wells are again rinsed. As 
a positive control, a commercial preparation of MAP kinase (lOng/well) is used in 
place 

of A431 extract. Plates are then treated with a commercial polyclonal (rabbit) 
antibody (lug/ml) which specifically recognizes the phosphorylated epitope of the 
Erk-1 and Erk-2 kinases (1 hr at RT). This antibody is biotinylated by standard 
procedures. The bound polyclonal antibody is then quantitated by successive 
incubations with Europium-streptavidin and Europium fluorescence enhancing 
reagent in the Wallac DELFIA instrument (time-resolved fluorescence). An increased 
fluorescent signal over background indicates a phosphorylation. 
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Example 21: Method of Determining Alterations in a Gene Correspon ding to a 
Polynucleotide 

RNA isolated from entire families or individual patients presenting with a 
phenotype of interest (such as a disease) is be isolated. cDNA is then generated from 
5 these RNA samples using protocols known in the art. (See, Sambrook.) The cDNA 
is then used as a template for PCR, employing primers surrounding regions of interest 
in SEQ ID NO:X. Suggested PCR conditions consist of 35 cycles at 95°C for 30 
seconds; 60-120 seconds at 52-58°C; and 60-120 seconds at 70°C, using buffer 
solutions described in Sidransky, D., et al., Science 252:706 (1991). 

1 0 PCR products are then sequenced using primers labeled at their 5* end with T4 

polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre 
Technologies). The intron-exon borders of selected exons is also determined and 
genomic PCR products analyzed to confirm the results. PCR products harboring 
suspected mutations is then cloned and sequenced to validate the results of the direct 

15 sequencing. 

PCR products is cloned into T-tailed vectors as described in Holton, T.A. and 
Graham, M.W., Nucleic Acids Research, 19: 1 156 (1991) and sequenced with T7 
polymerase (United States Biochemical). Affected individuals are identified by 
mutations not present in unaffected individuals. 

20 Genomic rearrangements are also observed as a method of determining 

alterations in a gene corresponding to a polynucleotide. Genomic clones isolated 
according to Example 2 are nick-translated with digoxigenindeoxy-uridine 5'- 
triphosphate (Boehringer Manheim), and FISH performed as described in Johnson, 
Cg. et al., Methods Cell Biol. 35:73-99 (1991). Hybridization with the labeled probe 

25 is carried out using a vast excess of human cot-1 DNA for specific hybridization to 
the corresponding genomic locus. 

Chromosomes are counterstained with 4,6-diamino-2-phenylidole and 
propidium iodide, producing a combination of C- and R-bands. Aligned images for 
precise mapping are obtained using a triple-band filter set (Chroma Technology, 

30 Brattleboro, VT) in combination with a cooled charge-coupled device camera 

(Photometries, Tucson, AZ) and variable excitation wavelength filters. (Johnson, Cv. 
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et al., Genet. Anal. Tech. Appl., 8:75 (1991).) Image collection, analysis and 
chromosomal fractional length measurements are performed using the ISee Graphical 
Program System. (Inovision Corporation, Durham, NC.) Chromosome alterations of 
the genomic region hybridized by the probe are identified as insertions, deletions, and 
translocations. These alterations are used as a diagnostic marker for an associated 
disease. 

Example 22: Meth od of Detecting Abnormal Levels of a Polypeptide in a 
Biological Sample 

A polypeptide of the present invention can be detected in a biological sample, 
and if an increased or decreased level of the polypeptide is detected, this polypeptide 
is a marker for a particular phenotype. Methods of detection are numerous, and thus, 
it is understood that one skilled in the art can modify the following assay to fit their 
particular needs. 

For example, antibody-sandwich ELISAs are used to detect polypeptides in a 
sample, preferably a biological sample. Wells of a microtiter plate are coated with 
specific antibodies, at a final concentration of 0.2 to 10 ug/ml. The antibodies are 
either monoclonal or polyclonal and are produced by the method described in 
Example 10. The wells are blocked so that non-specific binding of the polypeptide to 
the well is reduced. 

The coated wells are then incubated for > 2 hours at RT with a sample 
containing the polypeptide. Preferably, serial dilutions of the sample should be used 
to validate results. The plates are then washed three times with deionized or distilled 
water to remove unbounded polypeptide. 

Next, 50 ul of specific antibody-alkaline phosphatase conjugate, at a 
concentration of 25-400 ng, is added and incubated for 2 hours at room temperature. 
The plates are again washed three times with deionized or distilled water to remove 
unbounded conjugate. 

Add 75 ul of 4-methylumbelliferyl phosphate (MUP) or p-nitrophenyl 
phosphate (NPP) substrate solution to each well and incubate 1 hour at room 
temperature. Measure the reaction by a microtiter plate reader. Prepare a standard 
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curve, using serial dilutions of a control sample, and plot polypeptide concentration 
on the X-axis (log scale) and fluorescence or absorbance of the Y-axis (linear scale). 
Interpolate the concentration of the polypeptide in the sample using the standard 
curve. 

5 

Example 23: Formulating a Polypeptide 

The secreted polypeptide composition will be formulated and dosed in a 
fashion consistent with good medical practice, taking into account the clinical 
condition of the individual patient (especially the side effects of treatment with the 

10 secreted polypeptide alone), the site of delivery, the method of administration, the 

scheduling of administration, and other factors known to practitioners. The "effective 
amount" for purposes herein is thus determined by such considerations. 

As a general proposition, the total pharmaceutical^ effective amount of 
secreted polypeptide administered parenterally per dose will be in the range of about 1 

15 p.g/kg/day to 10 mg/kg/day of patient body weight, although, as noted above, this will 
be subject to therapeutic discretion. More preferably, this dose is at least 0.01 
mg/kg/day, and most preferably for humans between about 0.01 and 1 mg/kg/day for 
the hormone. If given continuously, the secreted polypeptide is typically 
administered at a dose rate of about 1 (ig/kg/hour to about 50 jxg/kg/hour, either by 1- 

20 4 injections per day or by continuous subcutaneous infusions, for example, using a 
mini-pump. An intravenous bag solution may also be employed. The length of 
treatment needed to observe changes and the interval following treatment for 
responses to occur appears to vary depending on the desired effect. 

Pharmaceutical compositions containing the secreted protein of the invention 

25 are administered orally, rectally, parenterally, intracistemally, intravaginally, 

intraperitoneally, topically (as by powders, ointments, gels, drops or transdermal 
patch), bucally, or as an oral or nasal spray. "Pharmaceutically acceptable carrier" 
refers to a non-toxic solid, semisolid or liquid filler, diluent, encapsulating material or 
formulation auxiliary of any type. The term "parenteral" as used herein refers to 

30 modes of administration which include intravenous, intramuscular, intraperitoneal, 
intrasternal, subcutaneous and intraarticular injection and infusion. 
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The secreted polypeptide is also suitably administered by sustained-release 
systems. Suitable examples of sustained-release compositions include semi- 
permeable polymer matrices in the form of shaped articles, e.g., films, or 
mirocapsules. Sustained-release matrices include polylactides (U.S. Pat. No. 
5 3,773,919, EP 58,481), copolymers of L-glutamic acid and gamma-ethyl-L-glutamate 
(Sidman, U. et aL, Biopolymers 22:547-556 (1983)), poly (2- hydroxyethyl 
methacrylate) (R. Langer et aL, J. Biomed. Mater. Res. 15:167-277 (1981), and R. 
Langer, Chem. Tech. 12:98-105 (1982)), ethylene vinyl acetate (R. Langer et aL) or 
poly-D- (-)-3-hydroxybutyric acid (EP 133,988). Sustained-release compositions 

10 also include liposomally entrapped polypeptides. Liposomes containing the secreted 
polypeptide are prepared by methods known per se: DE 3,218,121; Epstein et aL, 
Proc. Natl. Acad. Sci. USA 82:3688-3692 (1985); Hwang et aL, Proc. Natl. Acad. Sci. 
USA 77:4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 
142,641 ; Japanese Pat. Appl. 83-1 1 8008; U.S. Pat. Nos. 4,485,045 and 4,544,545; and 

15 EP 102,324. Ordinarily, the liposomes are of the small (about 200-800 Angstroms) 
unilamellar type in which the lipid content is greater than about 30 mol. percent 
cholesterol, the selected proportion being adjusted for the optimal secreted 
polypeptide therapy. 

For parenteral administration, in one embodiment, the secreted polypeptide is 

20 formulated generally by mixing it at the desired degree of purity, in a unit dosage 
injectable form (solution, suspension, or emulsion), with a pharmaceutically 
acceptable carrier, i.e., one that is non-toxic to recipients at the dosages and 
concentrations employed and is compatible with other ingredients of the formulation. 
For example, the formulation preferably does not include oxidizing agents and other 

25 compounds that are known to be deleterious to polypeptides. 

Generally, the formulations are prepared by contacting the polypeptide 
uniformly and intimately with liquid carriers or finely divided solid carriers or both. 
Then, if necessary, the product is shaped into the desired formulation. Preferably the 
carrier is a parenteral carrier, more preferably a solution that is isotonic with the blood 

30 of the recipient. Examples of such carrier vehicles include water, saline, Ringer's 
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solution, and dextrose solution. Non-aqueous vehicles such as fixed oils and ethyl 
oleate are also useful herein, as well as liposomes. 

The carrier suitably contains minor amounts of additives such as substances 
that enhance isotonicity and chemical stability. Such materials are non-toxic to 
5 recipients at the dosages and concentrations employed, and include buffers such as 
phosphate, citrate, succinate, acetic acid, and other organic acids or their salts; 
antioxidants such as ascorbic acid; low molecular weight (less than about ten 
residues) polypeptides, e.g., polyarginine or tripeptides; proteins, such as serum 
albumin, gelatin, or immunoglobulins; hydrophilic polymers such as 

10 polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, aspartic acid, or 

arginine; monosaccharides, disaccharides, and other carbohydrates including cellulose 
or its derivatives, glucose, manose, or dextrins; chelating agents such as EDTA; sugar 
alcohols such as mannitol or sorbitol; counterions such as sodium; and/or nonionic 
surfactants such as polysorbates, poloxamers, or PEG. 

15 The secreted polypeptide is typically formulated in such vehicles at a 

concentration of about 0.1 mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at a pH of 
about 3 to 8. It will be understood that the use of certain of the foregoing excipients, 
carriers, or stabilizers will result in the formation of polypeptide salts. 

Any polypeptide to be used for therapeutic administration can be sterile. 

20 Sterility is readily accomplished by filtration through sterile filtration membranes 
(e.g., 0.2 micron membranes). Therapeutic polypeptide compositions generally are 
placed into a container having a sterile access port, for example, an intravenous 
solution bag or vial having a stopper pierceable by a hypodermic injection needle. 
Polypeptides ordinarily will be stored in unit or multi-dose containers, for 

25 example, sealed ampoules or vials, as an aqueous solution or as a lyophilized 

formulation for reconstitution. As an example of a lyophilized formulation, 10-ml 
vials are filled with 5 ml of sterile-filtered 1% (w/v) aqueous polypeptide solution, 
and the resulting mixture is lyophilized. The infusion solution is prepared by 
reconstituting the lyophilized polypeptide using bacteriostatic Water-for-Injection. 

30 The invention also provides a pharmaceutical pack or kit comprising one or 

more containers filled with one or more of the ingredients of the pharmaceutical 
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compositions of the invention. Associated with such container(s) can be a notice in 
the form prescribed by a governmental agency regulating the manufacture, use or sale 
of pharmaceuticals or biological products, which notice reflects approval by the 
agency of manufacture, use or sale for human administration. In addition, the 
5 polypeptides of the present invention may be employed in conjunction with other 
therapeutic compounds. 

Example 24: Method of Treating Decreased Levels of the Polypeptide 

It will be appreciated that conditions caused by a decrease in the standard or 
10 normal expression level of a secreted protein in an individual can be treated by 
administering the polypeptide of the present invention, preferably in the secreted 
form. Thus, the invention also provides a method of treatment of an individual in 
need of an increased level of the polypeptide comprising administering to such an 
individual a pharmaceutical composition comprising an amount of the polypeptide to 
15 increase the activity level of the polypeptide in such an individual. 

For example, a patient with decreased levels of a polypeptide receives a daily 
dose 0.1-100 ug/kg of the polypeptide for six consecutive days. Preferably, the 
polypeptide is in the secreted form. The exact details of the dosing scheme, based on 
administration and formulation, are provided in Example 23. 

20 

Example 25: Method of Treating Increased Levels of the Polypeptide 

Antisense technology is used to inhibit production of a polypeptide of the 
present invention. This technology is one example of a method of decreasing levels 
of a polypeptide, preferably a secreted form, due to a variety of etiologies, such as 
25 cancer. 

For example, a patient diagnosed with abnormally increased levels of a 
polypeptide is administered intravenously antisense polynucleotides at 0.5, 1.0, 1.5, 
2.0 and 3.0 mg/kg day for 21 days. This treatment is repeated after a 7-day rest 
period if the treatment was well tolerated. The formulation of the antisense 
30 polynucleotide is provided in Example 23. 
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Example 26: Method of Treatment Using Gene Therapy 

One method of gene therapy transplants fibroblasts, which are capable of 
expressing a polypeptide, onto a patient. Generally, fibroblasts are obtained from a 
subject by skin biopsy. The resulting tissue is placed in tissue-culture medium and 
separated into small pieces. Small chunks of the tissue are placed on a wet surface of 
a tissue culture flask, approximately ten pieces are placed in each flask. The flask is 
turned upside down, closed tight and left at room temperature over night. After 24 
hours at room temperature, the flask is inverted and the chunks of tissue remain fixed 
to the bottom of the flask and fresh media (e.g., Ham's F12 media, with 10% FBS, 
penicillin and streptomycin) is added. The flasks are then incubated at 37°C for 
approximately one week. 

At this time, fresh media is added and subsequently changed every several 
days. After an additional two weeks in culture, a monolayer of fibroblasts emerge. 
The monolayer is trypsinized and scaled into larger flasks. 

pMV-7 (Kirschmeier, P.T. et aL, DNA, 7:219-25 (1988)), flanked by the long 
terminal repeats of the Moloney murine sarcoma virus, is digested with EcoRI and 
Hindlll and subsequently treated with calf intestinal phosphatase. The linear vector is 
fractionated on agarose gel and purified, using glass beads. 

The cDNA encoding a polypeptide of the present invention can be amplified 
using PCR primers which correspond to the 5' and 3' end sequences respectively as set 
forth in Example 1. Preferably, the 5' primer contains an EcoRI site and the 3* primer 
includes a Hindlll site. Equal quantities of the Moloney murine sarcoma virus linear 
backbone and the amplified EcoRI and Hindlll fragment are added together, in the 
presence of T4 DNA ligase. The resulting mixture is maintained under conditions 
appropriate for ligation of the two fragments. The ligation mixture is then used to 
transform bacteria HB101, which are then plated onto agar containing kanamycin for 
the purpose of confirming that the vector has the gene of interest properly inserted. 

The amphotropic pA317 or GP+aml2 packaging cells are grown in tissue 
culture to confluent density in Dulbecco's Modified Eagles Medium (DMEM) with 
10% calf serum (CS), penicillin and streptomycin. The MSV vector containing the 
gene is then added to the media and the packaging cells transduced with the vector. 
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The packaging cells now produce infectious viral particles containing the gene (the 
packaging cells are now referred to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequently, the 
media is harvested from a 10 cm plate of confluent producer cells. The spent media, 
containing the infectious viral particles, is filtered through a millipore filter to remove 
detached producer cells and this media is then used to infect fibroblast cells. Media is 
removed from a sub-confluent plate of fibroblasts and quickly replaced with the 
media from the producer cells. This media is removed and replaced with fresh media. 
If the titer of virus is high, then virtually all fibroblasts will be infected and no 
selection is required. If the titer is very low, then it is necessary to use a retroviral 
vector that has a selectable marker, such as neo or his. Once the fibroblasts have been 
efficiently infected, the fibroblasts are analyzed to determine whether protein is 
produced. 

The engineered fibroblasts are then transplanted onto the host, either alone or 
after having been grown to confluence on cytodex 3 microcarrier beads. 



Example 27: Method of Treatment Using Gene Therapy - In Vivo 

Another aspect of the present invention is using in vivo gene therapy methods 
to treat disorders, diseases and conditions. The gene therapy method relates to the 
introduction of naked nucleic acid (DNA, RNA, and antisense DNA or RNA) 
sequences into an animal to increase or decrease the expression of the polypeptide. 
The polynucleotide of the present invention may be operatively linked to a promoter 
or any other genetic elements necessary for the expression of the polypeptide by the 
target tissue. Such gene therapy and delivery techniques and methods are known in 
the art, see, for example, WO90/11092, W098/11779; U.S. Patent NO. 5693622, 
5705151, 5580859; Tabata H. et al. (1997) Cardiovasc. Res. 35(3):470-479, Chao J et 
al. (1997) Pharmacol. Res. 35(6):5 17-522, Wolff J.A. (1997) Neuromuscul. Disord. 
7(5):314-318, Schwartz B. et al. (1996) Gene Ther. 3(5):405-41 1, Tsurumi Y. et al. 
(1996) Circulation 94(12):328 1-3290 (incorporated herein by reference). 
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The polynucleotide constructs may be delivered by any method that delivers 
injectable materials to the cells of an animal, such as, injection into the interstitial 
space of tissues (heart, muscle, skin, lung, liver, intestine and the like). The 
polynucleotide constructs can be delivered in a pharmaceutically acceptable liquid or 
aqueous carrier. 

The term "naked" polynucleotide, DNA or RNA, refers to sequences that are 
free from any delivery vehicle that acts to assist, promote, or facilitate entry into the 
cell, including viral sequences, viral particles, liposome formulations, lipofectin or 
precipitating agents and the like. However, the polynucleotides of the present 
invention may also be delivered in liposome formulations (such as those taught in 
Feigner P.L. et al. (1995) Ann. NY Acad. Sci. 772:126-139 and Abdallah B. et al. 
(1995) Biol. Cell 85(1): 1-7) which can be prepared by methods well known to those 
skilled in the art. 

The polynucleotide vector constructs used in the gene therapy method are 
preferably constructs that will not integrate into the host genome nor will they contain 
sequences that allow for replication. Any strong promoter known to those skilled in 
the art can be used for driving the expression of DNA. Unlike other gene therapies 
techniques, one major advantage of introducing naked nucleic acid sequences into 
target cells is the transitory nature of the polynucleotide synthesis in the cells. Studies 
have shown that non-replicating DNA sequences can be introduced into cells to 
provide production of the desired polypeptide for periods of up to six months. 

The polynucleotide construct can be delivered to the interstitial space of 
tissues within the an animal, including of muscle, skin, brain, lung, liver, spleen, bone 
marrow, thymus, heart, lymph, blood, bone, cartilage, pancreas, kidney, gall bladder, 
stomach, intestine, testis, ovary, uterus, rectum, nervous system, eye, gland, and 
connective tissue. Interstitial space of the tissues comprises the intercellular fluid, 
mucopolysaccharide matrix among the reticular fibers of organ tissues, elastic fibers 
in the walls of vessels or chambers, collagen fibers of fibrous tissues, or that same 
matrix within connective tissue ensheathing muscle cells or in the lacunae of bone. It 
is similarly the space occupied by the plasma of the circulation and the lymph fluid of 
the lymphatic channels. Delivery to the interstitial space of muscle tissue is preferred 
for the reasons discussed below. They may be conveniently delivered by injection 
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into the tissues comprising these cells. They are preferably delivered to and 
expressed in persistent, non-dividing cells which are differentiated, although delivery 
and expression may be achieved in non-differentiated or less completely 
differentiated cells, such as, for example, stem cells of blood or skin fibroblasts. In 
5 vivo muscle cells are particularly competent in their ability to take up and express 
polynucleotides. 

For the naked polynucleotide injection, an effective dosage amount of DNA or 
RNA will be in the range of from about 0.05 g/kg body weight to about 50 mg/kg 
body weight. Preferably the dosage will be from about 0.005 mg/kg to about 20 
10 mg/kg and more preferably from about 0.05 mg/kg to about 5 mg/kg. Of course, as 
the artisan of ordinary skill will appreciate, this dosage will vary according to the 
tissue site of injection. The appropriate and effective dosage of nucleic acid sequence 
can readily be determined by those of ordinary skill in the art and may depend on the 
condition being treated and the route of administration. The preferred route of 

15 administration is by the parenteral route of injection into the interstitial space of 
tissues. However, other parenteral routes may also be used, such as, inhalation of an 
aerosol formulation particularly for delivery to lungs or bronchial tissues, throat or 
mucous membranes of the nose. In addition, naked polynucleotide constructs can be 
delivered to arteries during angioplasty by the catheter used in the procedure. 

20 The dose response effects of injected polynucleotide in muscle in vivo is 

determined as follows. Suitable template DNA for production of mRNA coding for 
polypeptide of the present invention is prepared in accordance with a standard 
recombinant DNA methodology. The template DNA, which may be either circular or 
linear, is either used as naked DNA or complexed with liposomes. The quadriceps 

25 muscles of mice are then injected with various amounts of the template DNA. 

Five to six week old female and male Balb/C mice are anesthetized by 
intraperitoneal injection with 0.3 ml of 2.5% Avertin. A 1.5 cm incision is made on 
the anterior thigh, and the quadriceps muscle is directly visualized. The template 
DNA is injected in 0.1 ml of carrier in a 1 cc syringe through a 27 gauge needle over 

30 one minute, approximately 0.5 cm from the distal insertion site of the muscle into the 
knee and about 0.2 cm deep. A suture is placed over the injection site for future 
localization, and the skin is closed with stainless steel clips. 
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After an appropriate incubation time (e.g., 7 days) muscle extracts are 
prepared by excising the entire quadriceps. Every fifth 1 5 urn cross-section of the 
individual quadriceps muscles is histochemically stained for protein expression. A 
time course for protein expression may be done in a similar fashion except that 
5 quadriceps from different mice are harvested at different times. Persistence of DNA 
in muscle following injection may be determined by Southern blot analysis after 
preparing total cellular DNA and HIRT supernatants from injected and control mice. 
The results of the above experimentation in mice can be use to extrapolate proper 
dosages and other treatment parameters in humans and other animals using naked 
10 DNA. 

Example 28: Transgenic Animals. 

The polypeptides of the invention can also be expressed in transgenic animals. 
Animals of any species, including, but not limited to, mice, rats, rabbits, hamsters, 
15 guinea pigs, pigs, micro-pigs, goats, sheep, cows and non-human primates, e.g., 
baboons, monkeys, and chimpanzees may be used to generate transgenic animals. In a 
specific embodiment, techniques described herein or otherwise known in the art, are 
used to express polypeptides of the invention in humans, as part of a gene therapy 
protocol. 

20 Any technique known in the art may be used to introduce the transgene (i.e., 

polynucleotides of the invention) into animals to produce the founder lines of 
transgenic animals. Such techniques include, but are not limited to, pronuclear 
microinjection (Paterson et al., Appl. Microbiol. BiotechnoL 40:691-698 (1994); 
Carver et al., Biotechnology (NY) 1 1:1263-1270 (1993); Wright et al., Biotechnology 

25 (NY) 9:830-834 (1991); and Hoppe et al., U.S. Pat. No. 4,873,191 (1989)); retrovirus 
mediated gene transfer into germ lines (Van der Putten et al., Proc. Natl. Acad. Sci., 
USA 82:6148-6152 (1985)), blastocysts or embryos; gene targeting in embryonic 
stem cells (Thompson et al., Cell 56:313-321 (1989)); electroporation of cells or 
embryos (Lo, 1983, Mol Cell. Biol. 3:1803-1814 (1983)); introduction of the 

30 polynucleotides of the invention using a gene gun (see, e.g., Ulmer et al., Science 
259:1745 (1993); introducing nucleic acid constructs into embryonic pleuripotent 
stem cells and transferring the stem cells back into the blastocyst; and sperm- 
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mediated gene transfer (Lavitrano et al., Cell 57:71 7-723 (1 989); etc. For a review of 
such techniques, see Gordon, "Transgenic Animals," Intl. Rev. Cytol. 115:171-229 
(1989), which is incorporated by reference herein in its entirety. 

Any technique known in the art may be used to produce transgenic clones 
5 containing polynucleotides of the invention, for example, nuclear transfer into 
enucleated oocytes of nuclei from cultured embryonic, fetal, or adult cells induced to 
quiescence (Campell et al., Nature 380:64-66 (1996); Wilmut et al., Nature 385:810- 
813 (1997)). 

The present invention provides for transgenic animals that carry the transgene 

10 in all their cells, as well as animals which carry the transgene in some, but not all their 
cells, i.e., mosaic animals or chimeric. The transgene may be integrated as a single 
transgene or as multiple copies such as in concatamers, e.g., head-to-head tandems or 
head-to-tail tandems. The transgene may also be selectively introduced into and 
activated in a particular cell type by following, for example, the teaching of Lasko et 

15 al. (Lasko et al., Proc. Natl. Acad. Sci. USA 89:6232-6236 (1992)). The regulatory 
sequences required for such a cell-type specific activation will depend upon the 
particular cell type of interest, and will be apparent to those of skill in the art. When 
it is desired that the polynucleotide transgene be integrated into the chromosomal site 
of the endogenous gene, gene targeting is preferred. Briefly, when such a technique is 

20 to be utilized, vectors containing some nucleotide sequences homologous to the 
endogenous gene are designed for the purpose of integrating, via homologous 
recombination with chromosomal sequences, into and disrupting the function of the 
nucleotide sequence of the endogenous gene. The transgene may also be selectively 
introduced into a particular cell type, thus inactivating the endogenous gene in only 

25 that cell type, by following, for example, the teaching of Gu et al. (Gu et al., Science 
265:103-106 (1994)). The regulatory sequences required for such a cell-type specific 
inactivation will depend upon the particular cell type of interest, and will be apparent 
to those of skill in the art. 

Once transgenic animals have been generated, the expression of the 

30 recombinant gene may be assayed utilizing standard techniques. Initial screening 
may be accomplished by Southern blot analysis or PCR techniques to analyze animal 
tissues to verify that integration of the transgene has taken place. The level of mRNA 
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expression of the transgene in the tissues of the transgenic animals may also be 
assessed using techniques which include, but are not limited to, Northern blot analysis 
of tissue samples obtained from the animal, in situ hybridization analysis, and reverse 
transcriptase-PCR (rt-PCR). Samples of transgenic gene-expressing tissue may also 
5 be evaluated immunocytochemically or immunohistochemically using antibodies 
specific for the transgene product. 

Once the founder animals are produced, they may be bred, inbred, outbred, or 
crossbred to produce colonies of the particular animal. Examples of such breeding 
strategies include, but are not limited to: outbreeding of founder animals with more 
10 than one integration site in order to establish separate lines; inbreeding of separate 
lines in order to produce compound transgenics that express the transgene at higher 
levels because of the effects of additive expression of each transgene; crossing of 
heterozygous transgenic animals to produce animals homozygous for a given 
integration site in order to both augment expression and eliminate the need for 
15 screening of animals by DNA analysis; crossing of separate homozygous lines to 
produce compound heterozygous or homozygous lines; and breeding to place the 
transgene on a distinct background that is appropriate for an experimental model of 
interest. 

Transgenic animals of the invention have uses which include, but are not 
20 limited to, animal model systems useful in elaborating the biological function of 
polypeptides of the present invention, studying conditions and/or disorders associated 
with aberrant expression, and in screening for compounds effective in ameliorating 
such conditions and/or disorders. 

25 Example 29: Knock-Out Animals. 

Endogenous gene expression can also be reduced by inactivating or "knocking 
out" the gene and/or its promoter using targeted homologous recombination. {E.g., 
see Smithies et al., Nature 317:230-234 (1985); Thomas & Capecchi, Cell 51:503- 
512 (1987); Thompson et al., Cell 5:313-321 (1989); each of which is incorporated by 
30 reference herein in its entirety). For example, a mutant, non-functional 
polynucleotide of the invention (or a completely unrelated DNA sequence) flanked by 
DNA homologous to the endogenous polynucleotide sequence (either the coding 
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regions or regulatory regions of the gene) can be used, with or without a selectable 
marker and/or a negative selectable marker, to transfect cells that express 
polypeptides of the invention in vivo. In another embodiment, techniques known in 
the art are used to generate knockouts in cells that contain, but do not express the gene 
5 of interest. Insertion of the DNA construct, via targeted homologous recombination, 
results in inactivation of the targeted gene. Such approaches are particularly suited in 
research and agricultural fields where modifications to embryonic stem cells can be 
used to generate animal offspring with an inactive targeted gene {e.g., see Thomas & 
Capecchi 1987 and Thompson 1989, supra). However this approach can be routinely 

10 adapted for use in humans provided the recombinant DNA constructs are directly 
administered or targeted to the required site in vivo using appropriate viral vectors that 
will be apparent to those of skill in the art. 

In further embodiments of the invention, cells that are genetically engineered 
to express the polypeptides of the invention, or alternatively, that are genetically 

15 engineered not to express the polypeptides of the invention (e.g., knockouts) are 
administered to a patient in vivo. Such cells may be obtained from the patient (i.e., 
animal, including human) or an MHC compatible donor and can include, but are not 
limited to fibroblasts, bone marrow cells, blood cells (e.g. . lymphocytes), adipocytes, 
muscle cells, endothelial cells etc. The cells are genetically engineered in vitro using 

20 recombinant DNA techniques to introduce the coding sequence of polypeptides of the 
invention into the cells, or alternatively, to disrupt the coding sequence and/or 
endogenous regulatory sequence associated with the polypeptides of the invention, 
e.g. , by transduction (using viral vectors, and preferably vectors that integrate the 
transgene into the cell genome) or transfection procedures, including, but not limited 

25 to, the use of plasmids, cosmids, YACs, naked DNA, electroporation, liposomes, etc. 
The coding sequence of the polypeptides of the invention can be placed under the 
control of a strong constitutive or inducible promoter or promoter/enhancer to achieve 
expression, and preferably secretion, of the polypeptides of the invention. The 
engineered cells which express and preferably secrete the polypeptides of the 

30 invention can be introduced into the patient systemically, e.g., in the circulation, or 
intraperitoneally. 
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Alternatively, the cells can be incorporated into a matrix and implanted in the 
body, e^, genetically engineered fibroblasts can be implanted as part of a skin graft; 
genetically engineered endothelial cells can be implanted as part of a lymphatic or 
vascular graft. (See, for example, Anderson etal. U.S. Patent No. 5,399,349; and 
5 Mulligan & Wilson, U.S. Patent No. 5,460,959 each of which is incorporated by 
reference herein in its entirety). 

When the cells to be administered are non-autologous or non-MHC 
compatible cells, they can be administered using well known techniques which 
prevent the development of a host immune response against the introduced cells. For 
10 example, the cells may be introduced in an encapsulated form which, while allowing 
for an exchange of components with the immediate extracellular environment, does 
not allow the introduced cells to be recognized by the host immune system. 

Transgenic and "knock-out" animals of the invention have uses which include, 
but are not limited to, animal model systems useful in elaborating the biological 
15 function of polypeptides of the present invention, studying conditions and/or disorders 
associated with aberrant expression, and in screening for compounds effective in 
ameliorating such conditions and/or disorders. 

It will be clear that the invention may be practiced otherwise than as 
20 particularly described in the foregoing description and examples. Numerous 

modifications and variations of the present invention are possible in light of the above 
teachings and, therefore, are within the scope of the appended claims. 

The entire disclosure of each document cited (including patents, patent 
applications, journal articles, abstracts, laboratory manuals, books, or other 
25 disclosures) in the Background of the Invention, Detailed Description, and Examples 
is hereby incorporated herein by reference. Further, the hard copy of the sequence 
listing submitted herewith and the corresponding computer readable form are both 
incorporated herein by reference in their entireties. 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule 13 bis) 



A The indications made below relate to the microorganism referred to in the description 



on page 



180 



,line 



N/A 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet | | 



Name of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 201 1 0-2209 
United States of America 



Date of deposit 



February 12, 1998 



Accession Number 



209628 



C ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet [^] 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated Slates) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g., "Accession 
Number of Deposit") 



For receiving Office use only 



Thi s sheet was received with the international application 



Authorized officer 



For International Bureau use only 



| | This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \3bis) 



A The indications made below relate to the microorganism referred to in the description 
on page 1^3 ii ne N/A 



B. IDENTIFICATION OFDEPOSTT Further deposits are identified on an additional sheet | | 



Name of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 201 1 0-2209 
United States of America 



Date of deposit 

February 25, 1998 


Accession Number 

209641 


C ADDITIONAL INDICATIONS ( leave blank if not applicable) This information is continued on an additional sheet | | 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable ) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g., "Accession 
Number of Deposit") 



Forreceiving Office useonly 



This sheet was received with the international application 



Authorized officer 



For International Bureau use only 



I I This sheet was received by the International Bureau c 



Authorized officer 



Form PCT/RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule 13 bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page !?? . line N/A 



B. roETTOOFlCATIONOFDEPOSrr Further deposits are identified on an additional sheet | | 



Name of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 201 1 0-2209 
United States of America 



Date of deposit 




Accession Number 






March 4, 1998 




209651 



C ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet [^j 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g., "Accession 
Number of Deposit") 



For receiving Office use only 



| ^j ^This sheet was received with the international application 

> i. ;■ M , 



Authorized officer 



For International Bureau use only 



| [ This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 
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What Is Claimed Is: 

1. An isolated nucleic acid molecule comprising a polynucleotide having 
a nucleotide sequence at least 95% identical to a sequence selected from the group 
consisting of: 

(a) a polynucleotide fragment of SEQ ID NO:X or a polynucleotide fragment 
of the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to 
SEQ ID NO:X; 

(b) a polynucleotide encoding a polypeptide fragment of SEQ ID NO: Y or a 
polypeptide fragment encoded by the cDNA sequence included in ATCC Deposit 
No:Z, which is hybridizable to SEQ ID NO:X; 

(c) a polynucleotide encoding a polypeptide domain of SEQ ID NO: Y or a 
polypeptide domain encoded by the cDNA sequence included in ATCC Deposit 
No:Z, which is hybridizable to SEQ ID NO:X; 

(d) a polynucleotide encoding a polypeptide epitope of SEQ ID NO: Y or a 
polypeptide epitope encoded by the cDNA sequence included in ATCC Deposit 
No:Z, which is hybridizable to SEQ ID NO:X; 

(e) a polynucleotide encoding a polypeptide of SEQ ID NO:Y or the cDNA 
sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X, 
having biological activity; 

(f) a polynucleotide which is a variant of SEQ ID NO:X; 

(g) a polynucleotide which is an allelic variant of SEQ ID NO:X; 

(h) a polynucleotide which encodes a species homologue of the SEQ ID 

NO:Y; 

(i) a polynucleotide capable of hybridizing under stringent conditions to any 
one of the polynucleotides specified in (a)-(h), wherein said polynucleotide does not 
hybridize under stringent conditions to a nucleic acid molecule having a nucleotide 
sequence of only A residues or of only T residues. 
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2. The isolated nucleic acid molecule of claim 1, wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding a secreted 
protein. 

3. The isolated nucleic acid molecule of claim 1, wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding the sequence 
identified as SEQ ID NO:Y or the polypeptide encoded by the cDNA sequence 
included in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X. 

4. The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises the entire nucleotide sequence of SEQ ID NO:X 
or the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to 
SEQ ID NO:X. 

5. The isolated nucleic acid molecule of claim 2, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the 
N-terminus. 

6. The isolated nucleic acid molecule of claim 3, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the 
N-terminus. 

7. A recombinant vector comprising the isolated nucleic acid molecule of 
claim L 

8. A method of making a recombinant host cell comprising the isolated 
nucleic acid molecule of claim 1 . 

9. A recombinant host cell produced by the method of claim 8. 



10. 



The recombinant host cell of claim 9 comprising vector sequences. 
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11. An isolated polypeptide comprising an amino acid sequence at least 
95% identical to a sequence selected from the group consisting of: 

(a) a polypeptide fragment of SEQ ID NO: Y or the encoded sequence 
5 included in ATCC Deposit No:Z; 

(b) a polypeptide fragment of SEQ ID NO: Y or the encoded sequence 
included in ATCC Deposit No:Z, having biological activity; 

(c) a polypeptide domain of SEQ ID NO: Y or the encoded sequence included 
in ATCC Deposit No:Z; 

10 ( d > a polypeptide epitope of SEQ ID NO: Y or the encoded sequence included 

in ATCC Deposit No:Z; 

(e) a secreted form of SEQ ID NO: Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(f) a full length protein of SEQ ID NO: Y or the encoded sequence included in 
1 5 ATCC Deposit No:Z; 

(g) a variant of SEQ ID NO: Y; 

(h) an allelic variant of SEQ ID NO:Y; or 

(i) a species homologue of the SEQ ID NO:Y. 

12. The isolated polypeptide of claim 11, wherein the secreted form or the 
20 full length protein comprises sequential amino acid deletions from either the C- 
terminus or the N-terminus. 

An isolated antibody that binds specifically to the isolated polypeptide 
A recombinant host cell that expresses the isolated polypeptide of 



13. 

of claim 1 1 

25 

14. 
claim 1 1 . 



15. A method of making an isolated polypeptide comprising: 
(a) culturing the recombinant host cell of claim 14 under conditions such that 
said polypeptide is expressed; and 
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(b) recovering said polypeptide. 

16. The polypeptide produced by claim 15. 

17. A method for preventing, treating, or ameliorating a medical condition, 
comprising administering to a mammalian subject a therapeutically effective amount 
of the polypeptide of claim 1 1 or the polynucleotide of claim 1 . 

18. A method of diagnosing a pathological condition or a susceptibility to 
a pathological condition in a subject comprising: 

(a) determining the presence or absence of a mutation in the polynucleotide of 
claim 1; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or absence of said mutation. 

19. A method of diagnosing a pathological condition or a susceptibility to 
a pathological condition in a subject comprising: 

(a) determining the presence or amount of expression of the polypeptide of 
claim 1 1 in a biological sample; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or amount of expression of the polypeptide. 

20. A method for identifying a binding partner to the polypeptide of claim 
1 1 comprising: 

(a) contacting the polypeptide of claim 1 1 with a binding partner; and 

(b) determining whether the binding partner effects an activity of the 
polypeptide. 



21. 



The gene corresponding to the cDNA sequence of SEQ ID NO: Y. 
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22. A method of identifying an activity in a biological assay, wherein the 
method comprises: 

(a) expressing SEQ ID NO:X in a cell; 

(b) isolating the supernatant; 

5 (c) detecting an activity in a biological assay; and 

(d) identifying the protein in the supernatant having the activity. 

23. The product produced by the method of claim 20. 
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<210> 1 

<211> 733 

<212> DNA 

<213> Homo sapiens 

<400> 1 
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gggatccgga gcccaaatct tctgacaaaa ctcacacatg cccaccgtgc ccagcacctg 60 

aattcgaggg tgcaccgtca gtcttcctct tccccccaaa acccaaggac accctcatga 120 

tctcccggac tcctgaggtc acatgcgtgg tggtggacgt aagccacgaa gaccctgagg 18 0 

tcaagttcaa ctggtacgtg gacggcgtgg aggtgcataa tgccaagaca aagccgcggg 240 

aggagcagta caacagcacg taccgtgtgg tcagcgtcct caccgtcctg caccaggact 300 

ggctgaatgg caaggagtac aagtgcaagg tctccaacaa agccctccca acccccatcg 360 

agaaaaccat ctccaaagcc aaagggcagc cccgagaacc acaggtgtac accctgcccc 42 0 

catcccggga tgagctgacc aagaaccagg tcagcctgac ctgcctggtc aaaggcttct 480 

atccaagcga catcgccgtg gagtgggaga gcaatgggca gccggagaac aactacaaga 540 

ccacgcctcc cgtgctggac tccgacggct ccttcttcct ctacagcaag ctcaccgtgg 600 

acaagagcag gtggcagcag gggaacgtct tctcatgctc cgtgatgcat gaggctctgc 660 

acaaccacta cacgcagaag agcctctccc tgtctccggg taaatgagtg cgacggccgc 72 0 

gactctagag gat 733 



<210> 2 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> Site 
<222> (3) 

<223> Xaa equals any of the twenty naturally ocurring L-amino acids 
<400> 2 

Trp Ser Xaa Trp Ser 
1 5 

<210> 3 
<211> 86 
<212> DNA 

<213> Homo sapiens 
<400> 3 

gcgcctcgag atttccccga aatctagatt tccccgaaat gatttccccg aaatgatttc 60 
cccgaaatat ctgccatctc aattag 86 



<210> 4 
<211> 27 
<212> DNA 

<213> Homo sapiens 
<400> 4 

gcggcaagct ttttgcaaag cctaggc 27 



<210> 5 
<211> 271 
<212> DNA 

<213> Homo sapiens 
<400> 5 

ctcgagattt ccccgaaatc tagatttccc cgaaatgatt tccccgaaat gatttccccg 60 
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aaatatctgc 
gcccctaact 
ttatgcagag 
ttttggaggc 



catctcaatt agtcagcaac catagtcccg cccctaactc cgcccatccc 
ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat 
gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt 
ctaggctttt gcaaaaagct t 



120 
180 
240 
271 



<210> 6 

<211> 32 

<212> DNA 

<213> Homo sapiens 

<400> 6 

gcgctcgagg gatgacagcg atagaacccc gg 3 2 

<210> 7 

<211> 31 

<212> DNA 

<213> Homo sapiens 

<400> 7 

gcgaagcttc gcgactcccc ggatccgcct c 31 



<210> 8 

<211> 12 

<212> DNA 

<213> Homo sapiens 



<210> 9 

<211> 73 

<212> DNA 

<213> Homo sapiens 

<400> 9 

gcggcctcga ggggactttc ccggggactt tccggggact ttccgggact ttccatcctg 60 
ccatctcaat tag 73 



<210> 10 

<211> 256 

<212> DNA 

<213> Homo sapiens 

<400> 10 

ctcgagggga ctttcccggg gactttccgg ggactttccg ggactttcca tctgccatct 60 

caattagtca gcaaccatag tcccgcccct aactccgccc atcccgcccc taactccgcc 120 

cagttccgcc cattctccgc cccatggctg actaattttt tttatttatg cagaggccga 180 

ggccgcctcg gcctctgagc tattccagaa gtagtgagga ggcttttttg gaggcctagg 240 

cttttgcaaa aagctt 256 



<400> 8 
ggggactttc 



cc 



12 
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<210> 11 

<211> 2343 

<212> DNA 

<213> Homo sapiens 



<400> 11 

acgcgtccgg tttttcaaag gtttaactgt ccagggcaga tacttaagac tatctgatca 60 

tccattaaaa acttttcaca tagtcttgct taaatggatc cattatgttt acccattata 120 

ttgttctcag ctgtagtttt aagaaattta tttcatttgt taatacttac tttccattac 180 

cttccccttt tctgtgacaa tccgttgata cttgaagacc tctcttgtat tcatcttagg 240 

gttaatattt ttaaggccaa acagcctaaa ttctatggta atcaactcca gccttgtgta 300 

atgaaatctt ctgcataaag ataggtttaa atcaaatcag attgcagatt ttattgaaga 3 60 

aattgtgttt ttaagagttg acaaatatat gttgtatggc taaaacaaag aaaatacttc 420 

tgttgcttct gcatttagta gaagaaaaac tatatatgtt tgtgaccaaa gtataaaata 480 

tgattctttc cagggaggta aaggttatgc acaagatttt cactagcagc tctaaaaggc 540 

taccctcaat taattgccat gaacatttca tagccctaga aggatgtagg ctcatttcag 600 

tgtcatcctg gtttattctt tattgtatta ttcagcagtc attttaacac tatgctagac 660 

actttagaga ttcagaagag taacagggtt tctgttctca tgaagcttat caggagacag 720 

aaaacatatg aattagatct aattggaggc aaactgaaat atatagtgga gttagtgtgg 780 

ttatcagcac ataaatgagt gatccatcaa caaaaggaga aattgggagg gttttatggg 840 

ccaaaaacag catgattaaa tgtgatagag tatatgtcat gttttaggtg tgatgaacat 900 

tcagttatgt gtgacgaata ggataattga aaaaatatga aaggctatga tgccagaaag 960 

tattatggga caagatctta aaaccagtgt tacctaggga gtatgaattt aatatgggaa 102 0 

ttcttaaact cctttatgac tggaagatga gcatcagagt gtctgcgacc attttgatga 1080 

tatgatgtac cagtttttaa atgtttggct ttttccaggt gatgaaagcg ggggatgagt 1140 

taagaaccac tgctgtgaag gattcacaac tatttttagg cagttgggta aaaatgacca 1200 

atttagtttt aagaaactga ctgtggctcc agagtatgtt ggagaagtga aaatggagac 1260 

taggaataac aggtgggaga ctattagtct aattaagatg taattataaa tctaagctag 1320 

gaacgtaaaa tgagaatgca aagtaagaaa caaatatggg gaaaattata tgtaaaagta 1380 

ataggacttg gcatcttact gatgtgattg attatgagaa aaatgaagca tgtggaggag 1440 

tccactggac agtaggaaat tcagcctaag acttgggtaa gagttctgtg gagttgtgaa 1500 

ttcagaggcc agagatgtga tatttaaaat tttggttcaa gatttcccag gtataagaaa 1560 

gcaagaggat taaagcattg taattaaact ttaagcagtg catatttatg ttatagataa 1620 

gataaacaag aaatctaggg atcaaatagg attaaaatta gtagtgatca ttcagtacag 1680 

tagttacgta ctgttattca caagagtata taaatcaaat tacaaggaat taaggatata 174 0 

aacgtgataa gaaagtatgc actgtactct ttgaggaagt ttgccataga aaggaagaag 1800 

aaataggatg gtagatcaga agtaaagcag gacccagtgg ggggagtgtt tgcagtgagg 1860 

cagtatgtat aatcatttaa aacatgggtt tggagtcctc tcaggttcca tgtttgtaat 192 0 

ggacataatg ataataatcc ctttcattta aggctgttgt gaggattaaa tgtgttaatg 1980 

tgcaaataac tttacacagt gcctggtata taataaatgc ttgctaccta ttaactagta 2040 

tttgtttcta aggctaattt aagtcctaga attgattgca aggattagat caggagtata 2100 

gtggacatgt tgggatttaa atatttaaat atagagatgc tttttaggac cattgttaga 2160 

accagaagag attttttacc aagttcacac agaaatgtag gtgcattggc tgggcatggt 2220 

ggctcacacc tgcagtccca gcacttggga aggctgaggc agaagaactg cttgaggcca 2280 

acattttgag accagcctgg gcaacatatt aagaccccgt ctccaccaaa aaaaaaaaaa 2340 

aaa 2343 



<210> 12 

<211> 1177 

<212> DNA 

<213> Homo sapiens 

<220> 
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<221> SITE 
<222> (1095) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1115) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1142) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1162) 

<223> n equals a,t,g, or c 



<400> 12 

agccaccatg cccggcctag attaaaaatt tgaagacata ttctctacta tgagccaatg 60 

aaattactca ttttgtttct atcccatttg ctgtcccttg cttttggaat tttgtgtctt 120 

agtgtgactg tgattctttc tctccttttg tctttcagca aacggggatt cagcgtccga 180 

tcctttggaa cagggactca cgtgaagctt ccaggaccag ctcccgacaa gcccaatgtt 240 

tatgatttca aaaccacata tgaccagatg tacaatgatc ttcttaggaa agacaaagaa 3 00 

ctctatacac agaatgggat tttacatatg ctggacagaa ataagagaat caagccccgg 3 60 

ccagaaagat tccagaactg caaagacctg tttgatctga tcctcacttg cgaagagaga 42 0 

gtgtatgacc aggtggtgga agatctgaat tccagagaac aggagacctg ccagccygtg 480 

cacgtggtca atgtggacat ccaggacaac cacgaggagg ccaccctggg ggcgtttctc 540 

atctgtgagc tctgccagtg tatccagcac acggaagaca tggagaacga gatcgacgag 600 

ctgctgcagg agttcgagga gaagagtggc cgcacctttc tgcacaccgt ctgcttctac 660 

tgagcccagc gcccgcatgg agccgcctct ggagcttcct gttgttcata ctttttcctt 720 

cctgacattt gtttttactt acaggtgttc tgctggtgac ggtagcatta cccaaataaa 780 

ctgtgcatat gaaatgggag aggagatgcc aaaacgccag atgaaagcaa tcaagtttct 840 

tcttttccac ttttacttat gagcrggata ttgattacaa agtttttctt ctttaaccaa 900 

aaaggaaaga caacggtttg tgtgcacttc ccgacatacc tgtgtcttcg tgtgcctgcc 960 

ttccctccct cctccccacc gggccggact gtacagagcc ctgctgcggc gtgttaggaa 1020 

tgacctggaa ttgtcaataa acagatgctg ctgtcaaaaa aaaaaaaaaa aaaaaaaaaa 1080 

aaaaaaaaaa raaancaaaa aaaaaaaaaa aaggnggggc cgaaggtttt ttccctttgg 1140 

tnggggttat ttttggcttg gnattggcct tcgtttt 1177 



<210> 13 

<211> 2107 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (149) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (487) 
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<223> n equals a,t,g, or c 



<400> 13 

tttaggtatg catataaaag aaaacaaaat atttaaaaca cttaaaggag atctgacgaa 60 

acctaaagag aagaaaaata ataaaattaa gtaaagaaaw ggtatggcag gattttatgt 12 0 

ttgcctgtgc cctttatcca actgatcang gcttcctgga wtcagtgaca gcagaagttg 180 

cctaccagat caagagactg aaatctcatc cttctatcat catatggagt ggcaataatg 240 

aaaatgagga ggcgctgatg atgaattggt atcatatcag tttcactgac cggccaatct 300 

acatcaagga ctatgtgaca ctctatgtga aaaacatcag agagctcgta ctggcaggag 3 60 

acaagagtcg tccttttatt acgtccagtc ctacaaatgg ggctgaaact gttgcagaag 42 0 

cctgggtctc tcaaaaccct aatagcaatt attttggtga tgtacatttt tatgactata 480 

tcagtgnatt gctggaactg gaaagttttc ccaaaagctc gatttgcatc tgaatatgga 540 

tatcagtcct ggccgtcctt cagtacatta gaaaaggtct cgtctacaga ggactggtct 600 

ttcaatagca agttttcact tcatcgacaa catcacgaag gtggtaacaa acaaatgctt 660 

tatcaggctg gacttcattt caaactcccc caaagcacag atccattacg cacatttaaa 720 

gataccatct accttactca ggtgatgcag gcccagtgtg tcaaaacaga aactgaattc 780 

taccgccgta gtcgcagcga gatagtggat cagcaagggc acacgatggg ggcactttat 840 

tggcagttga atgacatctg gcaagctcct tcctgggctt ctcttgagta cggaggaaag 900 

tggaaaatgc ttcattactt tgctcagaat ttctttgctc cactgttgcc agtaggcttt 960 

gagaatgaaa acaygttcta tatctatggt gtgtcagatc ttcactcgga ttattcgatg 1020 

acactcagtg tgagagtcca tacatggagc tccctggagc ccgtgtgctc tcgtgtgact 1080 

gaacgttttg tgatgaaagg aggagaggct gtctgccttt atgaggagcc agtgtctgaa 1140 

ttgctgagga gatgtgggaa ttgcacacgg gaaagctgtg tggtttcctt ttacctttca 1200 

gctgaccatg aactcctgag cccgaccaac taccacttcy tgtcctcacc gaaggaggcc 12 60 

gtggggctct gcaaggcgca gatcactgcc atcatctctc agcaaggtga catatttgtt 1320 

tttgacctgg agacctcagc tgtcgctccc tttgtttggt tggatgtagg aagcatccca 1380 

gggagattta gtgacaatgg tttcctcatg actgagaaga cacgaactat attattttac 1440 

ccttgggagc ccaccagcaa gaatgagttg gagcaatctt ttcatgtgac ctccttaaca 1500 

gatatttact gaaggaatct aggttgtatt ttcagtggac aatgggaata aagcatttct 1560 

aaagcaccga ctggagagga aggcaacaga gacaaggaga gaagccgaga gacatgtctg 1620 

cgtgctgcca cgcatctgag cgattgctct gtgaagagtt gtacactgaa cattttcagg 1680 

ggaggctgtt tacccaggca atgtcctcaa acaagcctgt gccggggtgt cctggaatct 1740 

gtgccaggac tgtgttttta gcccttcacc tctcagcttt agcaggacat gaaccagtta 1800 

taacaagatg sccctgcagc tggttacaag aatgtgacat ggcaggatct atggaaccaa 1860 

atggaaggtt ttgaggtgat gtaggtcttt cacagttagc tttggggaat acagaatact 1920 

caaataaagt gctttgttat tatttcagag ggaatggcga ttgaaatgtt acaacagaga 1980 

tttcttggtg gtagctattt gggtaaaggt atatggatat ttttctgtac atgtgaaatt 2040 

atataaaaat aaaagttata taaattacat tgaaaaaaaa aaaaaaaaaa aaaaaaaggg 2100 

cggccgc 2107 



<210> 14 

<211> 1262 

<212> DNA 

<213> Homo sapiens 



<400> 14 

cctaatggcc cgasctgaat acttgaagga gctcaagatg agggaatctc gctgggaagc 60 

tgacaccctg gacaaagagg gactgtcgga atctgttcgt agctcttgca cccttcagtg 120 

accctagaag aatgattgga cagatgtgag ccatctggag cagaggggca ctaacccagg 180 

ctgacgccaa gaatgaagtg gcccactgca gccctggcga gcaggcttct tggatggaca 240 

gtgctgagac ccccatatcc cagagtcccc agcctccctc aggttactct gcaccccaca 300 

gatggtttga tggctgtgct gtatactgga ggggagggca ggactctggg agaacagcac 360 

ttctttcatg agacctttgt tactcggtgg ttactgggtc ctgtgcctgt ccgttttggg 420 

gcatgcagcc ctctatcatt tttggctccg agaagagggc aaggggcccc cgcaggtarc 480 
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ttctgtgctt 
accccttatt 
ggaccaccct 
tcacacctgc 
tttggggttt 
ttccttggtc 
gggccttgtg 
ccagtgccct 
gctctttccc 
agcccacagc 
gagagtttct 
gcttccttgg 
tcattgtttc 
eg 



gccctcgccc tgecagcagg cagctgtgcc cctggcctgc ccttcccggg 
ccaactcagc tcctctttgc actggaatgg ggcactccaa cacccctcag 
ccccacagta tgcactcagc cccacagaac ccaccagtct ttctgggaac 
ccgccatctt ggtactttag gttaatccct caagcatgaa agctggatct 
aagaagecca agccttgttc ctgccctggc ctagggagca ctcaggaggg 
ctcatctctc ccacctccgt tccctctggg ccccacacta gccacagcgc 
ctggagtttg agectgggae agggagaggg aggcttggag acagtctgac 
ctaggccacc cacttctagg cctgccctgc cgccgtggag ccctgggcaa 
ctttctgggc ctgggtctcc ccatctcttc aatggggctg ataccttcac 
atgggcactt atgaggacaa agtgaattta acctggaaaa gaatgtattt 
tttaaataat cagcgggtgt tggtgatttg tagcccttct geccttaaat 
gcaagagctg tctgtcctcc ctgeaggagg ctgagtgtga agagtatcat 
tctattaaat tattttctgc taaaaaaaaa aaaaaaaaat ttctgcggtc 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1262 



<210> 15 
<211> 759 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (16) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (22) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (36) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (51) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (52) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (57) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (58) 

<223> n equals a,t,g, or c 
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<400> 15 

ggattaacaa attttncaca cnaggaaaac aggttnttga cccaattagg nnttttnnca 60 

aaaaagctta tttttaggtt gacacttatt agaagttacg ccttgcaggt taccggttcc 12 0 

ggaattcccg ggtcgaaccc caaggggttc gcggacccca gacatgagga ggctcctcct 180 

ggtcaccagc ctggtggttg tgctgctgtg ggaggcaggt gcagtcccag cacccaaggt 240 

ccctatcaag atgcaagtca aacactggcc ctcagagcag gacccagaga aggcctgggg 3 00 

cgcccgtgtg gtggagcctc cggagaagga cgaccagctg gtggtgctgt tccctgtcca 360 

gaagccgaaa ctcttgacca ccgaggagaa gccacgaggt cagggcaggg gccccatcct 420 

tccaggcacc aaggcctgga tggagaccga ggacaccctg ggccgtgtcc tgagtcccga 480 

gcccgaccat gacagcctgt accaccctcc gcctgaggag gaccagggcg aggagaggcc 540 

ccggttgtgg gtgatgccaa atcaccaggt gctcctggga ccggaggaag accaagacca 600 

catctaccac ccccagtagg gctccagggg ccatcactgc ccccgccctg tcccaaggcc 660 

caggctgttg ggactgggac cctccctacc ctgccccagc tagacaaata aaccccagca 720 

ggccgggaaa aaaaaaaaaa aaaaaaaaag ggcggccgc 759 



<210> 16 

<211> 1810 

<212> DNA 

<213> Homo sapiens 



<400> 16 

cacgagggtg tgcgtgctta ggcaggaacc cagttttact ttatgccatg tggaaagttt 60 

ctttttccag tatcaccagt gagttcactg tctctccact ggtctgcagt gctgcttctg 120 

ttacttgcag acttcccacg tgtgcatgga tctccacctg gggtctctag ggtctctatt 180 

ctacactgcc tatttccctt tctgtcctaa caccatagca tttaactcac ccgtcatcct 240 

gtgttgctga gaatttcctt catagaactc atcaaagtat gattaactgt gctccctgag 3 00 

ggcaggaatt atgccatctg gatcaccagc ctctcccttg tccttagcac gccatctgca 3 60 

aattagcaga tactcggtaa atgtgtatta actcgaagta tattttgtgt cttctctgtg 420 

cacagcactg ccctgggaag aactaggatg aggtattgac ttgctgttgc cacataacaa 480 

accctgccag aactccctgg atggaagtga ccaccgtgta tctgtggatt gtctgcaggg 540 

ctctgctggg gtcagcaggt cccacaacag agccagggct cggtctcctc atggctgtca 600 

gaggtttacg tattccgcct cctcccacca aagtctgaag ttgttgtatt ccattccttg 660 

ctatatccac atcttttaat aatgctaaaa tcccgtgttt ctctaaagca ttggattgaa 720 

ccaactgaag aaggaccacg tgtgttgctg ggcctgcttg ggcacaagcc gtttccgatc 780 

caagtcaact gctggtctgc ttagacgaag gtgtgtgggt gtctccacca cggagaggag 840 

ggacagcagg tgagaccata ggccaggaag gaagggcaca gcctaagcgt gcagtggctt 900 

agccagagac cctcgtgcac cagccttcca ggtgcttatt ggaacttatg tcagcccagg 960 

ccatatccaa gtgtgtgatg tctcggagca tatatgccag gccagccgga gaggcttagc 1020 

cctgccctgg tggagctgga gggccgcagg gccgcccggt ggggtcagga ggttgtgaag 1080 

aggatcctga tacaggctgg gcctccctgc aggcgtgagc cccggagcac ggggtgagca 1140 

gctccaccca gaggggcttg caggaccaag ctgggacagc aaccaccagg ccctggggca 1200 

gatcagtgag cgtccaggag atgcagatgc agaagacagc caaattcatt cacctctgcg 1260 

tgggcctgtg agggcccaca gagatgcatt ttcattcacg accaggattt cctcggccgg 1320 

agcagccgct tttcccagcc gaagctcact gtgtttacta cataggatgt gagtgtatag 1380 

aaagactctc tctaacgtta gtacgcgtgc agaaatgtgg ggccgcttac aagtgtgggc 1440 

agccgcagcc tgttcctcac ccctgtccta acgggacata ctccacgcat gcacatttag 1500 

gatcaccgtg tcttctcgtt ggactgatct gtcattagga ccctggaccc aagtaattgt 1560 

ctttgctctg aagttttgac agtaacaaag gcattccagc tctttctttt tcactcctgt 1620 

cggtgtaacg tgccgttttt catcctttga cttttagccc gcctgtgccc tgtctgaagg 1680 

gagttgtctg tggacagtca cggagtggtg ggtgtttgta atccactctg ccagcctcag 1740 

tcttctaact gttgcgtatg gaccaattac atctgccctt tctcttccct gctaaaaaaa 1800 

aaaaaaaaaa 1810 



BNSDOCID: <WO 9947540A1_I_> 
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<210> 17 
<211> 1052 
<212> DNA 

<213> Homo sapiens 



<400> 17 

gcaattttct 

aagataccat 

atttttctct 

gacaggtggg 

aaaacactga 

tatgatgtat 

gttgaactcg 

ccaaaactga 

tttggagaca 

actggttctt 

cttacatttg 

ccttgcacac 

tggaaaggta 

cctgtgatat 

gtcattgatt 

gaaagagtct 

acatcataaa 

aaaacatatg 



gcatagcatc 
atggacaatg 
ctggactgtg 
cttggatttt 
agttgcccaa 
tttttgtttt 
cagctggacc 
tctatttctc 
ttattgtacc 
cttacatata 
ttgttctggt 
ttattactgc 
acagctatca 
ctggtgaaca 
ttctacaaat 
tcaatgatat 
taactaaaac 
gtctgtgtar 



agcaatgagt 
cacgattgca 
catagcagta 
acaggatatc 
cttcaagtca 
cataacacca 
ttttggaaat 
agtaatgagt 
aggcctgttg 
ctatgtttcg 
gctgatgaaa 
ctcagttgtt 
gatgatggac 
gattgtccag 
agacttcgac 
gcttgcaaaa 
gctttgcttt 
tttcaaaaaa 



ctgtacaact 
tgtcgtggca 
gctgttgttt 
ttggggattg 
tgtgtgatac 
ttcatcacaa 
aatgaaaagt 
gtgtgcctca 
attgcatact 
tctacagttg 
aaggggcaac 
gcctggagac 
catttggatt 
caataatatt 
tttttaaatt 
atatattttt 
taatgttaaa 
aa 



gtcttgctgc 
aaaacatgga 
gggctgtgtt 
ctttctgtct 
ttctaggcct 
agaatggtga 
tgccagtagt 
tgcctgtttfc 
gtagaagatt 
cctatgctat 
ctgctctcct 
gtaaggaaat 
gtgcaacaaa 
atgtggaact 
gacttttgaa 
atgagctggt 
gttgtgcctt 



actaattcat 
agtgagactt 
tcgaaatgaa 
gaatttaatt 
tctcctcctc 
gagtatcatg 
catcagagta 
aatattgggt 
tgatgttcag 
tggcatgata 
ctatttagta 
gaaaaagttc 
tgaagaaaac 
gctataatgt 
ttgacaatct 
actgacagtt 
cacattaaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1052 



<210> 18 
<211> 1130 
<212> DNA 

<213> Homo sapiens 



<400> 18 

ggcacgaggc 

atgtctgaac 

cataagaata 

gataaaaatg 

ttatcaaagg 

gtacttagtt 

ttagttttcc 

ttaattctgc 

tctcttgcta 

cactgagaat 

tatttctctt 

cttctttctt 

gttggaaaac 

aagatataat 

tctatttgca 

ttacttattt 

atcagtggaa 

aagcctatat 

cgtgggcaat 



catttgtata 
tttcaggttg 
aaatggttca 
gagattttcc 
caattaaata 
acaaattgaa 
ttttccctga 
tggtgacagt 
atttgcttga 
aatacaactt 
taccttgttc 
ttatggctca 
aagatctgaa 
ggctttggat 
tttgtgtgtt 
tgtttccatg 
aagtaggttt 
ttgggaggcc 
gtagcgagac 



attctttagt 
tcttataatt 
caccaataca 
tgtgctacag 
gtgttgaatg 
ccctcttcta 
ttatcattta 
gccaaagctt 
ctagataact 
gcaagataat 
atttattacg 
gctcactatg 
tactatagaa 
tttggggtga 
attacttcta 
tctttttcca 
cgttatatag 
gaggcaggag 
ctggtctcta 



aaattgtatt 
gtctttttcc 
agtacttagt 
gcttagtcaa 
ttctgctttt 
tttttttcct 
ggcatgtaag 
tactatactc 
aagaattcag 
taatttggat 
acattttgaa 
ctttttttta 
aataataact 
tttttctact 
gttaagagta 
aaagaactta 
aaattaactt 
gattgcttga 
caaaaaaaaa 



aatgggagaa 
ttatgtcaga 
tgtggaaagg 
gcttatggtc 
acctacattt 
gctcctgttt 
tgacacccag 
tttttgttgt 
gtaagcatta 
tgttctacat 
ttatttacat 
atactggtag 
atttttctgt 
gtcagtttaa 
tttccaagga 
ttttttatat 
taggctgggt 
actcaggagt 
aaaaaaaaaa 



tctgtaagtt 
tgttctatgt 
gagagtagaa 
tatttaatgg 
catttttcat 
ctgtttcatt 
tagcattgct 
ctgttgcttt 
gctctttgtt 
gtatttcgtt 
acccatattt 
cttcctcaag 
ggtcatatta 
aaaaaacttg 
aagtttcatg 
tataataaat 
gcagtggctc 
tcgaaactag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1130 



<210> 19 
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<211> 883 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (19) 

<223> n equals a,t,g, or c 



<400> 19 

gtcaccgtgg gcgtttaant atgatccccg gctcagattc gcagactgca ctgaacttcg 60 

gctctacgtt gatgaagaag aagtctgatc ctgagggtcc cgcgctgctc ttccctgaga 120 

gtgaactttc catccggata ggtagagctg ggcttctttc agacaagagt gagaatggtg 180 

aggcatatca gagaaagaag gcggcagcca ctggcctccc agagggtcct gctgtccctg 240 

tgccttctcg agggaatctg gcacagcccg gcggcagcag ctggaggagg atcgcactgc 3 00 

tcatcttggc catcactata cacaacgttc cagagggtct cgctgttgga gttggatttg 360 

gggctataga aaagacggca tctgctacct ttgagagtgc caggaatttg gccattggaa 420 

tcgggatcca gaatttcccc gagggcctgg ctgtcagcct tcccttgcga ggggcaggct 4 80 

tctccacctg gagagctttc tggtatgggc agctgagcgg catggtggag cccctggccg 540 

gggtcttcgg tgcctttgcc gtggtgctgg ctgagcccat cctgccctac gctctggcct 600 

ttgctgccgg tgccatggtc tacgtggtca tggacgacat catccccgaa gcccagatca 660 

gtggtaatgg gaaactggca tcctgggcct ccatcctggg atttgtagtg atgatgtcac 72 0 

tggacgttgg cctgggctag ggctgagacg cttcggaccc cgggaaaggc catacgaaga 780 

aacagcagtg gttggcttct atgggacaac aagcttcttt cttcacatta aaactttttt 840 

ccktcctctc ttcttcaaaa aaaaaaaaaa aaaaaaactc gag 883 



<210> 20 

<211> 989 

<212> DNA 

<213> Homo sapiens 



<400> 20 

ctggcttggc tgctatactc ttgcccttca ctgaacctca gttttcctca tctgaatagt 60 

tgggagactc attcctgcct ttctcatgtc cctggctatt tggtaaacca gccagtagga 120 

agacatcgtg aaatgtatta aagtggtctt agctagacag agtgggcatg ccagggtcag 180 

cagagattct gaagtctaga ccagttccct gggtgggccg ttgtcagtcc tagcagatgg 240 

ccaggtcagc cctcaggctg gaaattttag ggcagctatt ggtaggtgtc tcctcttgct 300 

gtgctgagat acggtcaaga tcatacttag gcttttgttg gaagaacata caagacgaga 360 

gaaaaaaaaa gatcatactt aggggctccc ggaatttgct ctgccctagg ttgctgagac 42 0 

ctctagaacc tgtgcaggct aaaggaactc agtcggtaga tccgagagag gtggtcaggg 480 

agaccaggag catgtctaca ctgccagcag acttttgcct cctcccccaa gccagcagga 540 

tggcccaaaa aggctccccc agcagatcat ctttgcagct ccttttt tag ctccagtggc 600 

agcagggatg aggaagggaa agttctatca tttttttcta atttaaaatg acatttaaaa 660 

tcactagcct agtgggggcc aggtgtggtg gctcatgcct gttgtcccag cactttggga 720 

ggccaagacg agtggatcgc ttgagctcag gagttaaaga ccagtctggg caacatagcg 780 

aaacgccgtc tctataaaaa aatacaaaaa ttagctggat gtggtggtgc acacctgtat 840 

tcttagctgc ttgtggggct aaggcgaaag gatcacttga gcccaggagg tcaaggctgt 900 

agtgagctgt ttgtgccact gcactctagc ctgggtgaca aagcaaaacc ctgtctcaaa 960 

aaaaaaaaaa aaaaaaaaag ggcggccgc 9 89 



<210> 21 
<211> 495 
<212> DNA 



BNSDOCID: <WO_9947540A1_L> 



WO 99/47540 



PCT/US99/05804 



11 



<213> Homo sapiens 



<400> 21 

ggtggaatgt 

.ttccgacatt 

agaactacag 

cctgatgact 

cgactcaggc 

cagatgcccc 

gtggtgctca 

cccaggagtt 

aaaaaaaaac 



agtgaaaacg 
tgctgactgg 
gaaaggagca 
gtgagcctct 
tcatgtccca 
tgcaggccat 
cacctgtgat 
ggagaccaac 
tcgag 



agatgctgtc 
tcgatgctag 
gtatctctga 
tcctcctgct 
attcaaggca 
ctctccaggc 
cccaccactt 
gtaggcaata 



tctwagggac 
atgatcttga 
attatcgtgt 
tgccacctct 
gcaagaaggc 
tcaggaacct 
tgggaggctg 
cagcaagact 



taaggaagct 
gaccctgtgc 
ggaaggtcac 
cagtcacaag 
catggagcgg 
aaagaagaat 
aggcaagagg 
cccatctcta 



atctttcctc 
tgaggactgg 
ttgtctagcc 
acggctgctg 
cacctgccag 
ctacccagat 
atcacttgag 
caaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
495 



<210> 22 
<211> 2317 
<212> DNA 

<213> Homo sapiens 



<400> 22 

ccctaaagag 

taaaatggac 

aagtgaatta 

ggtaaggaga 

taattcaaag 

tggaaaaagt 

acgtatatgt 

tgtgtcttgt 

tgggccgcag 

tcagcttttc 

gatagaattt 

tgtaaggtaa 

aactctttga 

tgtcgctcag 

gttcaaggga 

aagcctagct 

ctggtctaca 

ttacaggtgt 

atgttattta 

ttagaggttt 

cagtggcgcg 

ctcagcctcc 

catttttagt 

cgtgattcgc 

cggccggtga 

tggggaaaac 

aaatgaagat 

actctgtcga 

cggacatttt 

atcaaggcag 

aatgaggagg 

gaaacccaaa 

gagcgtgtct 

cctgtttaaa 

ttgggaggcc 



ttgaagaact 

tggatttttg 

caaccttaaa 

actgtaatgt 

cattgattta 

atgaccccag 

gcatgttacc 

atttgcatat 

tttggattta 

ctctcacaga 

gagggggaaa 

cgggaccagc 

atcaagtcat 

gctggagtgc 

ttctcctgcc 

attttttttg 

actcctgact 

gagccaccgt 

tttattcatg 

tttttttttt 

atctcggctc 

tgagtggctg 

ataggtgggg 

ccgcttcggc 

tagagttttg 

cacaaagagg 

gggacatgtt 

ttgttttaca 

acaggtgaaa 

cccgggacca 

agatccagtg 

agtaagagtg 

accctcgtag 

aagagggctg 

gaggcgggcg 



aattggtcgg 

gtagttttgg 

tttgagattt 

ttaggattct 

attccacgta 

tgtcggagat 

tttacgtaca 

attagagtat 

tgtggtatgt 

ttaatctgcc 

ataacacacc 

ccaattatat 

aattttattt 

agtggcccaa 

tcagcctcct 

tattttttag 

tcaggtgatt 

gcccgacctg 

tcttcaatag 

ttttgagacc 

actgcaagct 

ggactacagg 

tttcaccgtg 

ttcctaaagt 

aacaagacaa 

gtgacaagat 

aagtaataat 

tataccttgt 

cagggacaca 

gagtccactc 

tggcagaggc 

gcctgagtgg 

gccattggag 

ctgtccgggc 

gatcgcgagg 



taaaaattgg 

ttgcttttaa 

cctttggtga 

gaataagtac 

gtctgttata 

ggctatgtgt 

tgtggaaaaa 

gattttccta 

tgatgaagac 

agtttctccc 

agctaatgat 

aactgattga 

aataattttt 

tcttggctca 

gagtggctgg 

tagagacggg 

caccggcctc 

aatcaagata 

gtatttacat 

gagtcttgct 

ccgcctcccg 

cgcccgccac 

ttagccagga 

gctgggatta 

agctccttgc 

aatttcagat 

aatagatgac 

gacaacccta 

ggaaaagtaa 

tctgagaaat 

cctggggagg 

cccagagagg 

gcttacatgt 

gcggtggctc 

tcgggagatc 



atattgaatt 

aaaaattagt 

accatggaag 

tgtgttttaa 

ttcagaaaca 

gcatgtatat 

cagttctaat 

atggtcgagg 

ttagtgaata 

actgtgtatt 

gaaacgaact 

ggcattgcca 

ttgaggcaga 

ctgcaacctc 

gattacaggc 

gtttcaccat 

ggcctcccac 

ttatttaaaa 

gtctgtcttc 

ctgttgccca 

aattcacgcc 

cacgcccggc 

tggtgtcgat 

caggcgtgag 

aaagctaacc 

agggctaagg 

atttgaacac 

agaggtaggc 

gtaacatgcc 

ggcatttggg 

aatgggctgg 

ttgatgggag 

ggaagggaca 

acgcctgtag 

gagaccatcc 



cataagatgt 

gctagctttc 

tttacccagt 

tcacagctct 

taaaaacaag 

acaaatagac 

taagtcaata 

gcctttttgg 

gccacagtac 

gtgtatatgt 

ggctctagtc 

tttttcactt 

gtttcgctct 

cgcctcccag 

acccgccacc 

gttggccggg 

acagctggga 

gaactgtttg 

ggagatggtg 

ggctggaggg 

attctcctgc 

taatattttg 

ctcctgacct 

ccaccaagcc 

ttgggaggtt 

gataggaaga 

tgtgccagtc 

accgttatta 

cagctgttga 

caggatcgag 

aagtattcga 

gcaggagtca 

ggttctgatt 

tcccagcact 

tggctaacac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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cccgtctcta ctaaaaatac aaaaacaaaa ttagcccggc atggtggcgg gcgcctgtgg 
tctcagctgc tcgggaggct gaggcgaaag aatggcgtga acctggaagg cggagcttgc 
agtgagccgg gattgtgcca ctgcactcca gcttgggcga cagagcgaga ctccatctcc 
aaaaaggaat tcgatatcaa gcttatcgat accgtcg 



2160 
2220 
2280 
2317 



<210> 23 

<211> 1726 

<212> DNA 

<2 1 3 > Homo sapiens 

<400> 23 

ctttttggct ctcattttga atttttcaag agctcatgtt ctttgtcttc attaaaaaaa 60 

aaaagttctt atatcgtgta tgaatgtcat tcgggatatc tatacataca tgcacatacc 12 0 

tcatttttat tgcatttcac tttattgcac tttgcaaagt gacatttttt acagattcaa 180 

ggtttggcaa ccctatgtcc ataagtctgt cagcaccatt tttctaacat atgtgctcat 240 

tttgcctctc tgtcacattt tttttttttc ctgagacagg gtcttgctct gtcacccagg 300 

ctggaatgca gtggtgtaat tatgcctcac tgcggccttg acctcttggg ctcaagggat 360 

cctcttgcct gcgcttcttg agtagctgag actacagatg tacaccacca cacacccagc 420 

taagttttaa atttttttat agagatgggg tttccctatg ttgcccaagc tgctctcgaa 480 

ctcctgggct taagtgatcc tcccacctca gcctttcaaa gtgctgggat tacaggcatg 540 

agccacagca cctggtctct gtgtcacgtt ataataattc tgccaatatt ccagatttcg 600 

tcattattaa atctgttatg gtgatctgtg atcagcgaac tctgatgtta ctgtctaatt 660 

gctttgggat gcagtgaacc cgtcagagtc atccatgagg gttggaatca acttcttcca 72 0 

aaatcctgtt aatcaagagt gaacttaatc gattaatgtt gtgtatgttc tgactgctcc 780 

accaatctgt gggtccccta tcactctccc tctcctcagg cctccctatt ccctgagaca 840 

caataatatt gaaattagac caattaataa ccctgcaatg tgaaaggaag aagttacatg 900 

tctctcactt taaatcaaaa gctagaaatt attaagctta gtgaggaggg catttcgaaa 960 

gctgagagag gctgaaagct aggccggttg tgccaaatag ctagccaagt tgtgaatgca 102 0 

taggaaaagt tcttgaagga aattaaaatt gttactccag tgaacacaca aatgttaagt 1080 

aagcaaaaca gccttattgc ttatggaaag aaagtctgaa tggtctgaat agaagatcac 1140 

atcagccaaa acatgtcctt aagccaaagc ctaatctata atagatcagg ccctaagtct 12 0 0 

cttccattcc ttgaaggcac agagaggtga agaagttgca gaagaaaagt tggaagctag 1260 

ccaaccttgg tttgtgcagt ttaaggaaaa aagccatctc cataacatgg aagtgcaaga 132 0 

tgaagcagca ggcactagtg gggaagctgc agcaagttat acagaaaatc tagctaatga 1380 

tgagggtggc tacactaaaa acagattttc aatggagaca aaacaccctt ctattggaag 1440 

aagatgccct ctaaagcttt cataggtaga gagaggtcag tgcatgggct tgaaagaaca 1500 

ggctgattct cttgctagag gccaatgaag ccagtgagtt taagttgaag ccagtgctaa 1560 

tttatcattc tgaaaattgt agggccctta agcattatgc taaatctact ctgcctgtgc 1620 

tctagaaatg gaatagcaaa gcacgaatga cagcacatct gtttacaaca tgatgtgctg 1680 

aatattttaa gcctatattt gagacctact gctcaaaaaa aaaaaa 172 6 



<210> 24 

<211> 529 

<212> DNA 

<213> Homo sapiens 

<400> 24 

acgcgtccga ttacttacgt gctcctggct gggatggcac tgggcattca gaaaaggttc 60 

tccccggagg tgctgggcct gtgtgcaagc acagcgctgg tgtgggtggt gatggaggtg 120 

ctggccctgc tcctgggcct ctacctggcc accgtgcgca gtgacctgag cacctttcac 180 

ctgctggcct acagtggcta caaatacgtg ggaatgatcc tcagtgtgct cacggggctg 240 

ctgttcggca gcgatggcta ctacgtggcg ctggcctgga cctcatcggc gctcatgtac 300 

ttcattgtgc gctctttgcg gacagcagcc ctgggccccg acagcatggg gggccccgtc 3 60 
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ccccggcagc gtctccagct ctacctgact ctgggagctg cagccttcca gcccctcatc 420 
atatactggc tgactttcca cctggtccgg tgaccccctg gccccagatg gcactgagtt 480 
tttcattcat tgaagatttg atttccttga aaaaaaaaaa aaaaaaaaa 529 



<210> 25 

<211> 1755 

<212> DNA 

<213> Homo sapiens 



<400> 25 

ggcacgagcc tcacagcgcc tctgctggag ttcctgctgg ccttgtactt cctctttgct 60 

gatgccatgc agctgaatga caagtggcag ggcttgtgct ggcccatgat ggacttcctg 120 

cgctgtgtca ccgcggccct catctacttt gctatctcca tcacggccat cgccaagtac 180 

tcggatgggg cttccaaagc cgctgggggg tctgtgcctg acactcgggc tgtttgtcca 240 

agcagatctg aaatgggccg tgagctgggg gcagcagcct cccgggagca gggagtcagc 300 

cctgtgatgc atcccatcca ccctgtccac aggtgtttgg cttctttgct accatcgtgt 360 

ttgcaactgr tttctacctg atctttaacg acgtggccaa attcctcaaa caaggggact 420 

ctgcagatga gaccacagcc cacaagacag aagaagagaa ttccgactcg gactctgact 480 

gaaggcctgc gggtgccttg gcaacctgag ccacacaggc ctccacccct gcgcctcaca 540 

ggggtcgctg gcgttggagc ggaggcctgg acttctgagt tgcagagggg gctgcggaca 600 

cagcaggccc cctacagcct caggttctgc ctgagcccag cctaccaggc ttgcccctca 660 

gctcagcact gttgaccacg ctgcgtatga gggcatcttg ggtatcccac tccttctccc 720 

catttctgtc ccacaggcct tcagcccttt aacgtctctg ccaaaaacca gcacaaggag 780 

acaaagcaga gccttgtctg tatctgggca gcaggtgttc catgctgcta ggtggcgggg 840 

gtcgggggtc ttctgtttca ctaacaggaa caaagacaga aaccatgaca gggctgcccc 900 

gccaggcccc ggtgggtttg tctgcacttg gtgctcctgc ccacaccagc cactttggtg 960 

acaatgaccc ttccaagaat ctttggttca aggagcacca gttccctctt cattcttgaa 1020 

gcagggagaa attgaccttt gccttgtcgc ccaggaagtg gggctcggca cccataacta 1080 

acacctccca cccttggaaa ccatgtcttc tgggggtgag atgaccattc tgggtctaag 1140 

actgtttcaa agaagagctc atagactgac tggtccagaa gacagagggt acaacagtgg 1200 

catcacagtg acagtgtcat ggggagctgg gcgggcccag ccaaaccctc cttcttccta 1260 

gagcccagcc agcaggcagg agttcctgga ccctcaggac agtgaacttc cagacctcag 1320 

ggcaggtcta tgggccactg caggagatga gaccagcctt ctgtgttcac ctaacgattt 1380 

atactgtgta tctgtctttg atggaatttt gtaacttttt atattttttt atgcaaaagc 1440 

agcttcttaa cagatggcat tttctgtgac tctaggcctc acaaaagagc cagagttctg 1500 

gacccatgtt tggagcattt gtagccttat tctcttgcgt gtgaatctct taccctgaaa 1560 

aaaagccata atgaattaag ccagactgac cacttgcttg gagtgtgtgc ttgaaaaaac 1620 

cagagcaata ctgttgggta ttgtatcagg cttcagtaca aactggtaac accaatgtgg 1680 

atcctgacag ctttcagttt tagcaaaaat acacgtgaaa tctgactacc atttaaaaaa 1740 

aaaaaaaaaa aaaaa 1755 



<210> 26 

<211> 1751 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1520) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
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<222> (1557) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1689) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1729) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1735) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1741) 

<223> n equals a,t,g, or c 



<400> 26 

gggtgcagcc tgatggcgca ggaggtagac acggcacagg gcgccgagat gcggcggggc 60 

gcgggcgcgg ctcggggacg cgcttcctgg tgctgggccc tggcgctgct ttggctcgcg 12 0 

gtggttccgg gctggtcccg ggtctcgggc atcccctccc ggcgccactg gccggtgccc 180 

tacaagcgct ttgacttccg tccaaaacct gatccttatt gtcaagctaa gtatactttc 240 

tgtccaactg gctcacctat cccagttatg gagggtgatg atgacattga agtttttcga 3 00 

ttacaagccc cagtatggga atttaaatat ggagacctcc tgggacactt gaaaattatg 360 

catgatgcca ttggattcag aagtacatta actggcaaga actacacaat ggaatggtat 420 

gaacttttcc aacttggcaa ctgtacattt ccccatctcc gacctgaaat ggatgcccct 480 

ttctggtgta atcaaggcgc tgcctgcttt tttgagggaa ttgatgatgt tcactggaag 540 

gaaaatggga cattagttca agtagcaact atatcaggaa acatgttcaa ccaaatggca 600 

aagtgggtga aacaggacaa tgaaacagga atttattatg agacatggaa tgtaaaagcc 660 

agcccagaaa agggggcaga gacatggttt gattcctacg actgttccaa atttgtgtta 720 

aggaccttta acaagttggc tgaatttgga gcagagttca agaacataga amccaactat 780 

acargaatat ttctttacag tggagaacct acttatctgg gaaatgaaac atctgttttt 840 

gggccaacag gaaacaagac tcttggttta gccataaaaa gattttatta ccccttcaaa 900 

ccacatttgc caactaaaga atttctgttg agtctcttgc aaatttttga tgcagtgatt 960 

gtgcacaaac agttctattt gttttataat tttgaatatt ggtttttacc tatgaaattc 1020 

ccttttatta aaataacata tgaagaaatc cctttaccta tcagaaacaa aacactctct 1080 

ggtttataaa acaccttaat tctactgctc ttttttctcc aatcaccagc atctgttttt 1140 

cagggggtga ttttactttt gtgaattcct tagcctttct tccttggtgc ataaagttaa 1200 

aatgcacatc agcagaattg ctgcatatta acatctcagg actcttctct tgtaaagaag 1260 

ctgaaattcg tactatattg gccaaagtga gcgagttagg tgatcttggt ttcaatttcc 1320 

gagcctttgt taatatggag aattatggtt catatcagtt atgtaggacc tttggaccca 1380 

gggtcctaca gatagatatg gtgtgcccag attttaaaaa taccttcaaa aataaaaaat 1440 

acattcagtg acattttcat ggtgggagct cttctttctg atatggcagt tacacttttt 1500 

cacttaagtg ctttagtttn agactaactt tacaacttct ataacttttg ggaaccnagt 1560 

ttagtatagt ctgattacat tccattcacc taactttagg cattcggttt agacaccata 1620 

actggrgkgr atkgkgcytc cyagratgtg ggcaaatccc agtggttaac accatatttc 1680 

tgggctggng attttgggga ctagctaggt aaacgggctt ggtggttcnt ttaancatac 1740 

ntaaccacca c i 7m 
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<210> 27 
<211> 1212 
<212> DNA 

<213> Homo sapiens 



<400> 27 

gccaagcttg gcacgargtt ggtggcggcg tccggaggtg ctggtttgtt ctcggtgaac 60 

ggcgcgcggg gtctctcctg agtgcgagct acgggacctt cgccatgccg gggatggtac 12 0 

tcttcggccg gcgctgggcc atcgccagcg acgacttggt cttcccaggg ttcttcgagc 180 

tggtcgtgcg agtgctgtgg tggattggca ttctgacgtt gtatctcatg cacagaggaa 240 

agctggactg tgctggtgga gccttgctca gcagttactt gatcgtcctc atgattctcc 3 00 

tggcagttgt catatgtact gtgtcagcca tcatgtgtgt cagcatgaga ggaacgattt 3 60 

gtaaccctgg accgcggaag tctatgtcta agctgcttta catccgcctg gcgctgtttt 420 

ttccagagat ggtctgggcc tctctggggg ctgcctgggt ggcagatggt gttcagtgcg 480 

acaggacagt tgtaaacggc atcatcgcaa ccgtcgtggt cagttggatc atcatcgctg 540 

ccacagtggt ttccattatc attgtctttg accctcttgg ggggaaaatg gctccatatt 600 

cctctgccgg ccccagccac ctggatagtc atgattcaag ccagttactt aatggcctca 660 

agacagcagc tacaagcgtg tgggaaacca gaatcaagct cttgtgctgt tgcattggga 720 

aagacgacca tactcgggtt gcttyttcga gtacggcaga gcttttctca acctactttt 780 

cagacacaga tctggtgccc agcgacattg cggcgggcct cgccctgctt catcagcaac 840 

aggacaatat caggaacaac caagacctgc ccaggtggtc tgccatgccc cagggagctc 9 00 

ccaggaagct gatctggatg cagaattaga aaactgccat cattacatgc agtttgcagc 9 60 

agcggcctat gggtggsccc tctacatcta cagaaacccc ctcacggggc tgtgcaggay 102 0 

tggtggtgac tgaaattagc tggacatggt tgcacacacc tgtaatcaca gctactcggg 1080 

aggttgaggc gggagaatcg cttgaaccag ggagttggag gttgcagtga gtggagatca 114 0 

caccattgcc ctgcagccta agcaacagag caagattctg tctcaaaaaa aaaaaaaaaa 12 00 

aaaaaactcg ag 1212 



<210> 28 
<211> 1112 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (1105) 

<223> n equals a,t,g, or c 



<400> 28 

ggcacgagca aacatccagg agtgtgcacc ggtcatgcaa ggtgttttgt ttggctttgt 60 

ctggcttttt agttttttgt ggcaggagaa taaatctagt gcctctccct ccacattagc 120 

caaaagtgga agtccctgtc cagtcagcat tccttggatg cctggtgtat tagtccgttt 180 

tttcacactg ctataataaa aagaactgcc caagactggg taactaataa aggaaagagg 2 40 

tttaattgac tcacacttct gcatgtttgg gaggcctcag gaaagttaca atcaggcaga 300 

aggtgaagtg cgttcgtctt aatggcggca ggtgagacag tgtgtaggat aaactgtcaa 3 60 

acacttataa aaccatcata gctcatgaga cctcattcac tgtcacgaga acagcatggg 420 

ggaaccgccc ccatgatcta atcacctccc actaggtccc tccctccacc tgtggggatt 480 

atgaggatta caattcaaga tgagatttgg gcaggggcac cgagccaaac catatcacct 540 

tatatgtgcc cagtgttgac ctaggcgctg ggatgcagaa acaaacacga catgggctgt 600 

gccttgggga gctcacactc ttgctggaga agcatgctga ttcctaaata agaaatgcta 660 

tgtgctgtgt acagagtacc atggaaggca ggatgaactc tttgggagga agaagcaagg 720 

aaagctttag agagttgctg gcttttgagg gatggagcag gcattttcta catggggaaa 7 80 

gtgtaggaaa gagtattcca ggcagagtgg agagcaagag caaaggcgga gaagcctgtg 840 
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ctgcgaattc cttgccgggc aggatccctg tcttactgct gtttagagat caatatatgt 900 

caagtgactg gaagtgtggt ttttgttctg ggactagtag gtagaacaga aagagttggg 960 

atggagtgag caacccatgg agaaatagag gctcggggtc agctgataca aggcgttgta 1020 

taccaagctg aggagcacaa gatttggaac ataataccaa atgctgggga gccatgggag 1080 

ggccatggga gctctgatag tgttntctcg ag 1112 



<210> 29 

<211> 748 

<212> DNA 

<213> Homo sapiens 



<400> 29 

ggcacgagcg aaactgtttt ccaatgtggc tgaaccactc tgcatttcca ccagtaatga 60 

gaatgagagt tgctgttgct ccacggcctc accagcattt ggtggtgtca gtgtcttgga 120 

ttttagecat cctaataagt gttagtggct atcattgttt teatttgeaa ttctcttaca 180 

tggtgtkgaa catctttccc catgtttatt tgtcatctgc atatcttctt eggecagtta 240 

tctgttcaga tcttttgccc gtttttgttt gettgeatgt ttgtttgtgt ttgatttttt 300 

aaagaaagct ttttttatta ttgagttgta atagtgcttg tatagtgtgg ataacagttc 360 

tctatcagat aggtcttttg caaatatttt ccccaatctg tggactgtct tctcattctt 420 

ttgataaatg gctttaaaat aataatctgg ccgggcgcag tggctcatgc ctgtaattcc 480 

agcactttgg gaggecaagg gcagatcatc tgaggtcggg agttcgagac cagcctgacc 540 

aacatggaga aaccccatct ctactaaaaa tataaaatta gtcgggcgtg gaggcacatg 600 

cctgtaatcc cagctacttg agaggctgag acaggagaat ctcttgaacc cgggaggtgg 660 

aggttgcagt gagecgaaat cgtgccactg tattccagcc tggacaataa gagcaaaact 720 

ccatctcaaa aaaaaaaaaa aactcgag 748 



<210> 30 

<211> 778 

<212> DNA 

<213> Homo sapiens 



<400> 30 

ggaactaaaa agctttgtgt tcttcagggt gggtggcagg gggatatagt gagggtggac 60 

cagggagaat gaccataggg cactaagtaa ggctgggatt ggatcagcag aaatccaacc 120 

ctctaacctt agggtaggga gtgctaagga tctggggaaa ccatgggctg ggaagctget 180 

cttgctctcc tcgtgtctgc tgtttttttc ccttggtgta ctatacagag gccagatgtt 240 

ggcaccacct ctccaggagg attggaaagg aggagtaaag gattctgatt tgattgatga 3 00 

ttccagtgca tccccaatcc caccatctta cctggaatat aaggctgect tgtacccctt 3 60 

ttctgagcac aagtctgtgc gtaatgeaac tgactctctt acttttttct tagtaactga 420 

tcatttccta gacaaccaag attctcaata agtcccagtc tcatcacaaa tattaatatt 480 

tccttttcct cataccaact tgactatgtt tcactgaaac ccacaggtct tgggacagaa 540 

tgaggcatta cctcattgaa etttagctge ctgcatgagt cctctgtcct caagtctttc 600 

tcagatcatt tctcaagctg gctcccagct tagggcaaag agaatctcca tgatgtgctg 660 

acttctagct tgccacagac acaattctac tccaaagtca gectggcata gtaacattga 720 

tgtcagggga gacatatcag tttgaggeca tacaaaaaaa aaaaaaaaaa aactcgag 778 



<210> 31 

<211> 1324 

<212> DNA 

<213> Homo sapiens 

<400> 31 
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acgagctaat gattcttgct gaagatggcc agagtaatca gagtaattaa tttggggaat 60 

ggtgaaggat aaggaacctg atcagatatc aagggtgggg gtactcttgc taaactgact 120 

cagtagggtc cttgctaaaa ctggttttta caagagagag cacaggtagg tctacaagaa 180 

gattcgggag actgactaaa gtttcatcaa gaatctttgt caggagacat gacctttatg 240 

atacttaagt ttttcttttt atgcggtttt gttttaaaca ggctaatagc tcgtcagctt 300 

gctaaaatcc atgctattca tgcacacaat ggctggatcc ccaaatctaa tctttggcta 360 

aagatgggaa agtatttctc tctcattccc acaggatttg cagatgaaga cattaataaa 420 

aggtaaaatt atttttacat ttgaaattat gttttacatt gctttgctct atgatagggt 480 

ttcaaaggta attgtaaagt ttcattgtat aaaatctggt tttctttctt tgcattgaag 540 

taaagtaagc attgattctt tggctgtcag atagcactac agaaataact gcctctccat 600 

ccccctcagt gtcccctccc aaaaaatatg ccctacaaca gcaaggggca gaagtggaag 660 

taggggaaca ccacttaaaa ataaagaggt aggtggtaat ggtgaagcaa ggatttttct 720 

tgacttttta agatactgca gggttgagag gcaagtctag tattatatag aataagagca 7 80 

caagagcctg gagccaaact gagattaaat cttagtcctg ctagttaaca tttctgttgt 840 

gtaactcatt caactaggga acttaaccta actgtttcct tatctataaa atggaaatta 900 

cagtagtagt atattaacca tatagagttt ttgtgaggat tgagatagta tatgtaaagt 960 

tctttaaaac agtgcctggt catcacttaa gtgttgaggt agctgctgtt ttaaaaatta 1020 

ctattgttat tcaaagaagg ttatttgagt tatatttttt ccctaggctc tccaaggtat 1080 

actttaaatc cttgaggtta atgattcttt ggaaaagctg gaggtgtgct gtggtaaata 1140 

ataggaagaa aacccttgcc ccaatagaaa ataatacaac tagaacataa aacacaatta 1200 

aaatattaaa tacattttgt atttctttta ttccattttg tttgttcatg atttaagttt 1260 

tataatcctt cgaatcccac aattttcatt cgacactacc tttaaaaaaa aaaaaaaaaa 132 0 



<210> 32 

<211> 739 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (732) 

<223> n equals a,t,g, or c 



<400> 32 

ggcacgagga 

tagattgcag 

gcctgtcact 

catttttcag 

tctcattttt 

gtgcttactt 

ggtttacaca 

ggaggaaaaa 

tattttttaa 

tgggttgtgt 

aggcaaagtc 

aacctctgcc 

ctgaggtcgt 



caggatcctg 
ggcatttgtt 
gctaactcct 
ttgatagttt 
attttgctgg 
ttggagtttt 
gtaaacaatg 
aagagatata 
gcctagaggg 
taagcctaac 
tcgccctgtc 
tcctgaggca 
gncattgca 



gtttgggtac 
tgagactatt 
tagtattaaa 
atatactttc 
attgttttct 
gattccctgt 
tgaatgtgat 
aaggtaatca 
aactctttgt 
cctaacttct 
acccaggctg 
ggagaatcac 



cttagtttaa 
tagccacagc 
actgtcaaac 
tctgaaggat 
gttttttgct 
gtcactgttt 
caccaaaata 
ccaccaccct 
tggctctgtt 
wctctctctc 
gagtgcagtg 
ttgaacctgg 



tagaaaccca 
agggcaagca 
atgggaggta 
cctaatgata 
tcagcattct 
tctttcgcat 
cgcacagaac 
cccacctcct 
aagtttaggg 
tctctttttt 
gcacgacctt 
caggcggagg 



ggtggaaacc 
ggaagatgca 
actgctgatg 
gttaaccatt 
tgcttttgct 
acacctctca 
atctgaccga 
gttttgttgt 
ttaatgtgat 
tttttttttg 
ggctcactgc 
ttttggtgag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
739 



<210> 33 

<211> 1462 

<212> DNA 

<213> Homo sapiens 
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<400> 33 

ggccatcggc ggggcagtcg cgggatgcgc ccgggagcca cagcctgagc tttagcccat 60 

gaggaggatg tgaccgggac tgagtcagga gccctctgga agcatggaga ctgtggtgat 120 

tgttgccata ggtgtgctgg ccaccatctt tctggcttcg tttgcagcct tggtgctggt 180 

ttgcaggcag cgctactgcc ggccgcgaga cctgctgcag cgctatgatt ctaagcccat 24 0 

tgtggacctc attggtgcca tggagaccca gtctgagccc tctgagttag aactggacga . 300 

tgtcgttatc accaaccccc acattgaggc cattctggag aatgaagact ggatcgaaga 360 

tgcctcgggt ctcatgtccc actgcattgc catcttgaag atttgtcaca ctctgacaga 420 

gaagcttgtt gccatgacaa tgggctctgg ggccaagatg aagacttcag ccagtgtcag 480 

cgacatcatt gtggtggcca agcggatcag ccccagggtg gatgatgttg tgaagtcgat 540 

gtaccctccg ttggacccca aactcctgga cgcacggacg actgccctgc tcctgtctgt 600 

cagtcacctg gtgctggtga caaggaatgc ctgccatctg acgggaggcc tggactggat 660 

tgaccagtct ctgtcggctg ctgaggagca tttggaagtc cttcgagaag cagccctagc 720 

ttctgagcca gataaaggcc tcccaggccc tgaaggcttc ctgcaggagc agtctgcaat 780 

ttagtgccta caggccagca gctagccatg aaggcccctg ccgccatccc tggatggctc 840 

agcttagcct tctacttttt cctatagagt tagttgttct ccayggctgg agagttcagc 900 

tgtgtgtgca tagtaaagca ggagatcccc gtcagtttat gcctcttttg cagttgcaaa 960 

ctgtggctgg tgagtggcag tctaatacta cagttagggg agatgccatt cactctctgc 1020 

aagaggagta ttgaaaactg gtggactgtc agctttattt agctcaccta gtgttttcaa 1080 

gaaaattgag ccaccgtcta agaaatcaag aggtttcaca ttaaaattag aatttctggc 1140 

ctctctcgat cggtcagaat gtgtggcaat tctgatctgc attttcagaa gaggacaatc 1200 

aattgaaact aagtaggggt ttcttctttt ggcaagactt gtactctctc acctggcctg 1260 

tttcatttat ttgtattatc tgcctggtcc ctgaggcgtc tgggtctctc ctctcccttg 1320 

caggtttggg tttgaagctg aggaactaca aagttgatga tttctttttt atctttatgc 1380 

ctgcaatttt acctagctac cactaggtgg atagtaaatt tatacttatg tttccctcaa 1440 

aaaaaaaaaa aaaaaactcg ag 1462 



<210> 34 

<211> 2815 

<212> DNA 

<213> Homo sapiens 



<400> 34 

gggtcctgga gtgccctcgg ctgatagaga ctatagttcg agagttcttg cccaccagtt 60 

ggtctcctgt gggggcaggg cctaccccta gtctatacaa agtaccctgt gctactgcca 120 

tgaaactact tcgtgtcctg gcctcagctg ggaggaatat tgctgcccgg ctgttgagca 180 

gctttgatct ccggagccgc ctgtgccgca tcatagctga ggctccccaa gaactggcct 240 

tgcccccaga ggaagctgag atgctgagca ccgaggccct ccgtctgtgg gctgtggctg 300 

cctcctatgg ccagggcggt tacctttaca gggagctcta cccagtgctg atgcgggcct 360 

tgcaggtggt gccgcgggag ctcagcaccc acccacctca acccctgtcc atgcagcgga 420 

tagcctcact gctcactctc ctcacccagc taaccctggc agccggcagt acccctgctg 480 

aaaccatcag tgattctgct gaggccagcc tctcggccac cccttcctta gtcacttgga 540 

cacaggtgtc tgggctccag cctcttgttg agccgtgtct aaggcagacc ttgaagttgc 600 

tgtccagacc tgagatgtgg agagccgtgg gcccagtgcc cgttgcctgc ctgttgttcc 660 

tgggagccta ctaccaggcc tggagccagc aaccaagctc atgcccggag gattggctcc 720 

aggacatgga gcgcctgtca gagagctgct gctgccactg ctgagtcagc ccacactggg 780 

cagcctgtgg gattccctta ggcactgctc ccttctctgc aacccgctgt cctgtgtgcc 840 

agcccttgaa gctcccccca gcctcgtgtc actgggctgc tcgggaggct gcccccgtct 900 

cagtctggct ggctcagcct cacccttccc attcctcact gccctcctct ctcttcttaa 960 

taccctggcc cagatccaca aggggctgtg tggccagctg gctgccatat tggctgcccc 1020 

gggactccag aattacttcc tccagtgtgt ggctcctggg gctgccccac acctcacacc 1080 

tttctctgca tgggccctgc gccatgagta ccacctgcag tacctggcac tcgctctggc 1140 

ccagaaagcg gcagcgctgc agccactgcc agccacccat gctgccctct atcatggtat 1200 
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ggccttggcc ctgctgagcc ggctgctgcc cggaagtgag tacctcaccc atgagctgct 1260 

gctgagctgt gtattccggc tggagttcct cccggaaaga acatcagggg gtccagaggc 1320 

agccgacttc tctgaccagc tgtcgttagg aagcagcaga gtccctcggt gtgggcaagg 13 80 

gactctgctg gctcaggcct gccaggacct ccccagcatc cgcaactgct acctgactca 1440 

ttgctcgcca gcccgagcca gtctgctggc ctcccaggct ctgcaccgag gggagctaca 1500 

gcgagtccca accctgctac tgcccatgcc tacggagccg ctgctgccca ccgactggcc 1560 

cttcctgcac tgattcgcct ctacaccggg cttcagacac cccctcggga ctctctccac 1620 

agacaccatg ggcacagcca tgcgggtcct gcagtgggtg ctagttttgg agagctggcg 1680 

cccccaggct ctctgggctg tgccccctgc tgcccgcctg gcacggctca tgtgtgtgtt 1740 

cctggtggac agtgagctgt tccgggagtc cccagtacag catctggtgg cagccctcct 1800 

cgcccagctc tgtcagcctc aagtcttgcc aaacctcaac ctggactgcc gactccctgg 18 60 

cctgacgtct ttccctgacc tctatgccaa cttcctggat cattttgagg ctgtctcttt 1920 

tggggaccac ctctttgggg ccctggtcct cctgcccctg cagcgtcggt tcagtgtcac 19 80 

cttgcgcctt gccctctttg gggaacacgt gggagccttg cgagctctga gcctgcctct 2 040 

gacccagttg cctgtgtccc tggagtgtta cacagtgcct cctgaagaca acctggccct 2100 

ccttcagctc tacttccgga ccctggttac tggtgcgctc cgcccacgtt ggtgccccgt 2160 

gctatatgct gtggctgtgg ctcatgtcaa tagcttcatc ttctctcagg acccacagag 2220 

ctcagatgag gtcaaagctg cccgcaggag tatgctgcag aaaacatggc tgctggcaga 22 80 

tgagggtctc cggcagcacc tcctgcacta taagcttccc aattccacgc tcccagaggg 234 0 

ctttgagctc tattctcagt tgccccctct gcgtcagcac tacctccaga gactgacttc 2400 

aacagtgctc caaaatgggg tatcagagac ctaggatagt tgatatagat ggaaagatgg 2 460 

gtacgttgtc ctgtatccag cctttcaaca gatgtctggc cagacgaaga acattgtgtc 2 52 0 

ctaatggtag gcaggagacc aaggagcaga aggcttgcct tcctgggagc aggttgtttg 2 580 

agctgtttta gagcagtgag ccctaccatt acatcctgat atctggggct tctgaaggtc 2 64 0 

tgtgctggga gtgaagagtg gcttagctat ttacccgctc tttggggaca gggcaaacta 2700 

aatgcatccc ttcttaccta actcccaacc cctgccctgg gctgaggcat atgaatgcta 2760 

tagttgtgca ttaaaataaa tgttttttat ctcctggaaa aaaaaaaaaa aaaaa 2 815 



<210> 35 
<211> 1078 
<212> DNA 

<213> Homo sapiens 



<400> 35 

ggtgggctct gtgctgggtg ccttcctcac cttcccaggc ctgcggctgg cccagaccca 60 

ccgggacgca ctgaccatgt cggaggacag acccatgctg cagttcctcc tgcacaccag 12 0 

cttcctgtct cccctgttca tcctgtggct ctggacaaag cccattgcac gggacttcct 180 

gcaccagccg ccgtttgggg agacgcgttt ctccctgctg tccgattctg ccttcgactc 240 

tgggcgcctc tggttgctgg tggtgctgtg cctgctgcgg ctggcggtga cccggcccca 300 

cctgcaggcc tacctgtgcc tggccaaggc ccgggtggag cagctgcgaa gggaggctgg 360 

ccgcatcgaa gcccgtgaaa tccagcagag ggtggtccga gtctactgct atgtgaccgt 420 

ggtgagcttg cagtacctga cgccgctcat cctcaccctc aactgcacac ttctgctcaa 480 

gacgctggga ggctattcct ggggcytggg cccagctcct ctactatccc cccgacccat 540 

cctcagccag cgctgccccc atcggctctg gggaggacga agtccagcag actgcagcgc 600 

ggattgccgg ggcyctgggt ggcctgctta ctcccctctt cctccgtggc gtcctggcct 660 

acctcrtctg gtggacggct gcctgccagc tgctcgccag ccttttcggc ctctacttcc 720 

accagcactt ggcaggctcc tagctgcctg cagaccctcc tggggccctg aggtctgttc 780 

ctggggcagc gggacactag cctgccccct ctgtttgcgc ccccgtgtcc ccagctgcaa 840 

ggtggggccg gactccccgg cgttcccttc accacagtgc ctgacccgcg gccccccttg 900 

gacgccgagt ttctgcctca gaactgtctc tcctgggccc agcagcatga gggtcccgag 960 

gccattgtct ccgaagcgta tgtgccaggt ttgagtggcr agggtgatgc tggctgctct 1020 

tctgaacaaa taaaggagca tgccgatttt taaaaaaaaa aaaaaaaaaa aaaaaaaa 107 8 
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<210> 36 

<211> 1217 

<212> DNA 

<213> Homo sapiens 



<400> 36 

cggcacgagg ttgaatgtta gccctggagg agatccatgt cttactcgct ctttctggcc 60 

cttctgtctt ttgcctctgc aattcttttt gtagctggca cgatagcagg gactgggggt 12 0 

ctatcctttc atggtattgc tacaatattt gtccttactg gaaaatggta acatccgggt 180 

ctgatttaat tggcattaca cttacacagg gactctgagc acccccgtca ccacaccaga 240 

cagtggacca gttttcacag ctacaaagag ctagaaatgt gtttaacatc atccagtgca 3 00 

tcccctaatt caaaaccatc ctcactaatc aatcatattc acccataaat attacaaatg 3 60 

agattgattc catctcaaga caatttgtca aatacttaat tttcttcctg gatgattcta 420 

cttactggat attttagaaa gagaaatgtc tgagataaaa tccctcacat ttactcaata 480 

taacaaatta ctgtttctac tcctattctg agtagtgctt ctgaagattg tttgctgtag 540 

tgttgtcttt gataaaatga atgtcagtag tgagcctttt agagatacca tgctcagaaa 600 

tcctctttgg gatcagaaga tacctaaaat tctccccttt tgcccacttg gttagatgag 660 

tgatatattc tttggatcct gcaaagaaga gattggtttc ttttcttttc tggtggtggt 720 

agtggttgta tctgtggctg tgatggttgt tgttacttgt ctctctctct ctctggctct 780 

ggcttttgct ttcctgctag tgttctttct ctttccaaac aaatagttaa attaaacgtg 840 

agcttctgaa ttgtacttgt tcatactttc aaaacataac agattaataa aaatagatgt 900 

gtcctgattt aaaacatgcc ccctggaaag gcatgctgta ttatgaaatc atgataatat 960 

aactgcatta ttacatggca gtataaatat tagtctgttg aattcatttg tccaattgta 1020 

taactttgtg gagcagtgtt ttgacctttg atacataatt ctggagcaag tggagtggtt 1080 

gcaggcagat gagacagtgt tatatcagga tttttcaatc aactttagtt ggaggcctgg 1140 

caattacaaa catcttcaga tgtttctgta accattataa atatgaaaaa aacctcttca 1200 

aaaaaaaaaa aaaaaaa 1917 



<210> 37 

<211> 1282 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (153) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1220) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1222) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1232) 

<223> n equals a,t,g, or c 
<220> 
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<221> SITE 
<222> (1246) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1282) 

<223> n equals a,t,g, or c 



<400> 37 

actcgtgccg aattcggcac gagccattct gagtttggtc ccttcccaaa agtaggggtt 60 

ttgtgtggaa aatctgagca aacctctgtt gactgttctg gggtggagtg aagggagaar 12 0 

gggctcagct aaagaacatg gggagattag ggnaacaatg ccttttattt cttgctttta 180 

aagcaatttc aggagttttc ttcctctttt ggcgtcctgc tgactccaca gagcggaaca 240 

cccaaagctg ggactttcca cctctctaat gctcagtgaa gagcgggcca ggggggtgtg 3 00 

gaaaagaaag ggtcctggag gagcccaaat tacgaatggc tagagactgg cattggcaag 3 60 

cgaggaggct tcgtcacagt gtagtcttcc ggttgtccga gggtactgtc ccaggggctg 420 

gggggtrttc cgtcttctgc agatcaactc ccgcaggcta aatgtggaca tcgcggtatc 480 

atgcttgata aacggaccaa taatcaagtg gagattcatt agaaccacat aacccatact 540 

aggttgattt ctcaagtata agscctggtc tgttgcccag sctggagtgc actgacacca 600 

tcatggttca ctgcagcctc aaactcctga gcccaagtga tycctcccac tcagcctcac 660 

aagtagctaa gactagaggt gtgcaccatc amacccagct aatttttaaa gttttttttg 720 

tararatggg gtcccactct acaaaatatw taagtataag gcckggtctg ttgcccargc 780 

tgggsaaccc ttggactaag gcaatcctcc agcctcagcc tcctaaagtg ctgggattac 840 

aggcgtgagc caccgcaccc acctctagga tctctactat tgaggaaaaa ttggaggcat 900 

caaactccaa gggcaaaaca tgaagactcg ctggcccacc atggatggag gttttctctc 960 

ttaaaattcc cacagcaccg catggaactg cctctcctgg gacctcagcg tttccttctt 1020 

tgctctaagc aatagcctct gccactggag attctgagat ggccgatttc cttttggata 1080 

tttaagtttt gaaatcatgc tcatttggca taggaatgtt tcacttcagt ctcctttaaa 1140 

caaaaggaca cacaaccacg attgcccctc cctcccgaag ggtcactgga cttcatgcat 1200 

cagtaatgtt tccaaaaatn tnttaagtac cnacatgcag tggccngctt ttcatttttc 12 60 

caagtgaagc catcagaaaa an 12 82 



<210> 38 
<211> 559 
<212> DNA 

<213> Homo sapiens 



<400> 38 

gattcggcac gagctgaagc cctgggtgcc actgctggcc cagcagggag gaggttgctg 60 

ctgctcgggc tgaagtgagg tgtgggtctg gctgggcctc cagtttccca cctgggcctt 120 

gattgtgagg aaggcctggc ctggctgcag aagcccagaa gcacctgagt aggagagttc 180 

ctttgtccca cctgcagctc attcaagcct gtgcatgggg gttggggtcc tcaggatctt 240 

gctttcctgt ttaggggagg cagccccaaa gagtgctggg accagtttgg agagtgctaa 3 00 

ggaatgctgg tctgcagcga ccctacttgt gctctgcgtc ctctgccaac tgcagcatgg 360 

gtgaacatct gtacatctgt ccccataatg aaaatggcct cagcaaataa caaaaatatt 420 

accatttagc aatcaggcac ttattaaaag cctggcccaa taaacttaaa aaaaaaaaaa 4 80 

aaaaactcga gggggggccc ggtacccaat tcgccctata gtgagtcgta ttacgcgsgs 540 

tcamtggccg tcgtttaca 559 



<210> 39 
<211> 803 
<212> DNA 
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<213> Homo sapiens 
<400> 39 

ggcagagcta ggccaggcag agcctagctc ttgccagggc agcaggaagc cacacagtgt 
gttgaagccg gagcaggaga gggggccctg actcccatgt gtccttgcag gcaggagcag 
ttcgtggact tgtacaagga gtttgagcca agcctggtca acagcaccgt ctacatcatg 
gccatggcca tccagatggc acctttcgcc atcaattaca aagtaaggcc tgggccctgc 
cmaaacattc actgtctgcc cacccagccc caccccatga agccatctgt ccctcatccc 
cacagggccc gcccttcatg gagagcctgc ccgagaacaa gcccctggtg tggagtctgg 
cagtttcact cctggccatc attggcctgc tcctcggctc ctcgcccgac ttcaacagcc 
agtttggcct cgtggacatc cctgtggagt tcaagctggt cattgcccag gtcctgctcc 
tggacttctg cctggcgctc ctggccgacc gcgtcctgca gttcttcctg gggaccccga 
agctgaaagt gccttcctga gatggcagtg ctggtaccca ctgcccaccc tggctgccgc 
tgggcgggaa ccccaacagg gccccgggag ggaaccctgc ccccaacccc ccacagcaag 
gctgtacagt ctcgcccttg gaagactgag ctgggacccc cacagccatc cgctggcttg 
gccagcagaa ccagccccaa gccagcacct ttggtaaata aagcagcatc tgagatttta 
aaaaaaaaaa aaaaaaactc gag 



<210> 40 

<211> 1510 

<212> DNA 

< 2 1 3 > Homo sapi ens 

<220> 

<221> SITE 
<222> (426) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (454) 

<223> n equals a,t,g, or c 



<400> 40 

cacgagaaac attctatctt ttatcaaatg tgtgattcat aacttttgga taccaaagga 60 

atctaacgaa ataaccataa tcatcaatcc atacagggag actgtgtgct tctctgtgga 120 

gcctgtcaag aagatattta actatatgat acatgtgaat cgaaacatca tggatttcaa 180 

actcttcctt gtgtttgtgg caggagtttt tcttttcttt tatgcaagga ccctggagtc 240 

aaagccctac tttctattac tcctcgggaa ctgtgctagg tgttctaatg acatagtctt 300 

tgtcttgctg ttggtgaaaa gattcatccg aagtatagca ccttttgggg ctctaatggt 360 

tggttgttgg tttgcctcag tttatattgt atgccagttg atggaagatc tgaagtggct 42 0 

gtggtntgaa aacaggatat atgtatcagg ctangtcttg atagttggat ttttcagctt 480 

tgttgtttgt tacaagcatg ggccccttgc acacgacagg agcagaagtc ttctgatgtg 540 

gatgctgcga ctcctctccc tggttctggt ctatgctggt gtggctgtgc ctcagtttgc 600 

ctatgcagcc ataatcctcc tcatgtcctc ctggagtctg cactacccac tgagagcatg 660 

cagttatatg aggtggaaaa tggagcagtg gtttacatca aaagagctgg tggtgaaata 720 

tcttacggaa gacgagtaca gggagcaagc tgatgctgaa acgaacagtg ctctggagga 780 

gctacgccgg gcctgccgaa aacccgactt tccctcatgg ctggtcgtct ccagactcca 840 

cactcctagc aaatttgcag attttgttct tggaggaagc cacttgtcac ctgaagaaat 900 

cagtctgcat gaagagcagt atggccttgg gggtgccttc ttggaagagc agctctttaa 960 

cccgagtact gcctgacatg cgaccttcaa gttgacttca ttctggacaa ggaagtgggc 1020 

aaagggcagg attctattaa agttaggcag aactgttcta gtgaacggtg gcaaaaacat 1080 

ttgctgtgga gaaaaacaag tcagtctgga aaggaaaacc aacccatttt gaagataact 1140 

tagcattctt ggtgacttct gctacttatt gtactgtagg tggataccaa aattctgtga 1200 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
803 
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cagccactac cacttacctt gaatgaaggc tttcattagg aacaggggaa tggcgttgtt 12 60 

cttaaggggc tagtaagcat gaacaggtgc tttgtcgaca ccagggcact aaatctggtc 1320 

ttaatcccct gaacctgtgt cagaagactc tgcaatactc ttcctatagt tcgtcagtat 13 80 

aagtccttaa agagacctga gacatgctgg accagtgttt tccaaagtac agctcacagg 1440 

ctactaccaa gtgttggtca ataaaggtat tctgaggtca actaagattg ataaaaaaaa 1500 

aaaaaaaaaa 1510 



<210> 41 
<211> 1095 
<212> DNA 

<213> Homo sapiens 



<400> 41 

gcttggtggt gctatttgct tcttcaaatt ctcgttattt aaaatatttc tttcttgtac 60 

cgttgattct gggatcagcc tggatgtgtc aaacactgcc tgccaggctt agagctcagt 120 

gcatttcttc ccttttattc ctgctgatgg gattgctggc catgaccggt gagaggaatc 180 

aaggaaccca ttactatgag ttctcaggat tcatcttcaa atctcaaatg atgtggtcaa 240 

ttaaaccaaa ttaaaaacaa gctcttgtta aaagcaagtt aaaaacaagc tcttgacctt 300 

gagaagaaat gattggtatt aggaagactg ttgagctgat actgcccttc attcattctc 360 

taccctggtg cttggataca ggagcaaagt aagaaaataa tcacagcttt attgagggct 42 0 

ctatgagcaa ggcttggtga ggatggaaga gaatggagct atcagttgat gagaacctac 480 

taggtgttga gctccttaca ttcattgcct atttaaaact ttctaacaac ttcatgtgta 540 

agcgttgtcc cgatttaaaa aaaaaaatag atgtggaaac tgaacctgga gaaggtgtgt 600 

aatttgtcca aggttgcaca ggcaaagggg caaaattcag ctttaaaccc aggactgttt 660 

ccacagctcc aagtyccctt tattcatggg atttgtaaga tggagcccct gccactgtag 720 

catttataac ttactttgga gaataagatt cctgaaagta cgtttaataa aaaaaaaaga 780 

tgtccagcta tgtacggcag ctcacgcctg taatcccagc actttgagag gcaaaggggg 840 

gaggatagct tgaggctaag agtttaagac taacctgggc aacatggcaa gaccctgtct 900 

ctaaaaaaca aaattagcca gttgtggtgg catgcacctg tagtccaagc tactcaggag 9 60 

gctaaggtga gagggtcgct tgagcccagg agtttgaggc tgcagtgagc catgatggcg 1020 

ccactgcact ccagtgcaga gtgcaggcta cagaatgaga ccccatcaca aaaaaaaaaa 1080 

aaaaaaaaac tcgag 1095 



<210> 42 
<211> 1162 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (340) 

<22 3> n equals a,t,g, or c 



<400> 42 

ggcacgagct gattcctaag gaatattcta gccaaatcat gtatctgtgg tttagttttt 60 

ctacagtagg gctgtgcggt tgctgcctgc tttatagggc atgtgggttt atatggtatc 120 

tgctgttact tgggcacagc agcaccaact cattacagga tggaggggca gaacgcccag 180 

agcacccctg ggctcacgtg cggtacagct gcaggagaga gctgtccttt tggttttatg 240 

tttttaatta attctgtttc ctcagattga tgattaaatt tatttttcca gcctgaccaa 300 

gaaggcgtca ccataccaga tctggggagt ctctcctcan ctctgataga cacagagagg 3 60 

aatctgggcc tgcttctcgg attacacgct tcctatttag caatgagcac accgctgtct 420 

cctgtcgaga ttgaatgtgc cagtaagaaa atctttactt tttgctaatt agcagatttt 480 

ttttttttgg aactgtaagt gccattaaga gtgggagagg gccaggcaca gtggttcatg 540 



BNSDOCID: <WO 9947540A1_L> 



WO 99/47540 



PCT/US99/05804 



24 



cctgtaatcc 
agaccagcct 
gcatgttggc 
gagcctggga 
gacagagcaa 
taatcccagc 
cagcctggcc 
ggtgacgggt 
ccaggacaca 
agagcgagac 
gacaaaaaaa 



cagcactttg 
gggcaacatg 
acgtatttgt 
ggttgaggct 
gactctctct 
actttgggag 
aatgtggtga 
gcctgtaatc 
gaggttgcag 
tcggtctcca 
aaaaaaaaaa 



ggaggttgtg 
gcaaaacccc 
agtcccagat 
gcagtgagtc 
ttaaaaaagc 
gctgaggcgg 
aaccccatgt 
ctaggtactc 
tgagctgaga 
aaaaaaaaaa 
aa 



gcacgtggat 
atctctacaa 
actcaggagg 
atgatcatac 
aggagatggc 
gtggatcacc 
ctactaaaaa 
gggacgctga 
tcacgccact 
aggagaggag 



tgcttgagat 
aaaacacaaa 
ctgaggtagg 
cactgcactc 
caggcagtgg 
tgaggtcagg 
tgcaaaaatt 
cgtaggagaa 
gcactccagc 
gattcaacac 



caagattttg 
aattagccag 
aggattgctt 
cagcctgggt 
ctcatgcctg 
agttcaagac 
agctgggtgt 
ttgcttgaac 
ctgggtgaca 
agttgatgat 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1162 



<210> 43 
<211> 657 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (12) 

<223> n equals a,t,g, 



<400> 43 

cccccccggg 

aactaatcac 

tgggtcaaaa 

ttgggctacc 

caccttcttc 

tgcagtatca 

agaatatggc 

ccttatcctg 

atacaatctt 

caaacatccc 

attaaaaaaa 



gntgcaggaa 
aagcagtttc 
tggagcctga 
caccactccc 
ctctcctgat 
ctggatagga 
atccctccac 
accaaagact 
acttgcaggt 
ctggttggtg 
aaaaactttt 



ttcggcacar 
taaaccaaaa 
gtcctgggcc 
aaggcattct 
caacatcggt 
ctggtggaaa 
ctatatttga 
gtgttggggt 
ggatattctc 
atcacttaca 
tgttaatata 



attttacatg 
aatgacatgt 
ctgtgcctgc 
tccaaatgtg 
atgatgtctc 
gggagcagcc 
tgtggacggt 
gccatttgaa 
tatactctct 
gttgtgtcca 
aaaaaaaaaa 



cttttaagtt 
tgtaaaagga 
ttcttttcct 
aaatcctgga 
ctgttgcctc 
tgacagagct 
aaggctaggc 
aatcgcaggg 
tttaatgcat 
cctttatttt 
aaaaaaaaaa 



aatgttggaa 
caataaacgt 
gggaacagcc 
agtaagattg 
accctttgtc 
ccaaatgtgg 
ctgcaggatc 
ttgcaaaaga 
ctaaaaatcc 
atgtactttg 
aaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
657 



<210> 44 

<211> 1155 

<212> DNA 

<213> Homo sapiens 



<400> 44 

ggcacgagtg 

acatgctctt 

tcaccatctc 

gccatctgtt 

ctctagtttc 

ccagtgggct 

aaaatgaagg 

ggggagaatg 

ggaaaatccc 

actgacaaat 

catcagtttt 

agcacaatgc 



gaagtgtaag 
cctctctgct 
tgctcctcat 
agagctccaa 
aaattcctgg 
gcaaaatttc 
gaataattgt 
cccaggggac 
aattttattt 
aatccatggg 
caacaccttg 
atctccctct 



cagaaataca 
tctatctgca 
cccgcatggt 
ccacgtggaa 
ggaaaaatga 
caagcgtagc 
gagctgttca 
agatgcattt 
tcctacagag 
ggcagcttag 
atacatcagg 
aattgtgtca 



gcgagggctc 
catctgcttt 
ggggaaggat 
tgacggaatc 
cccagctcac 
ttctgtcagt 
gattcaccaa 
gggtaaggga 
tcagcatccc 
cagatgggtt 
cttggccctt 
tgtgctggag 



aggaaatact 
atttctttgc 
gcccacccac 
cattctgttc 
ttcaggctcc 
tccttgcttt 
gaaattatct 
caataacaag 
acacattttc 
gaaaaaagcg 
gctacctcat 
gagaatgtga 



agaataggca 
ctcagcagac 
acctccccag 
tctatctctg 
cactcttggt 
gggttaggtg 
actattgttg 
acactagaaa 
cttcacagaa 
acaggctcat 
gcattattta 
agttctgtct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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aaaaaaaaaa aaaaa 



gtctttagca aacatgtttc aagtactgtc tgtctgaaaa ccaaatggaa gagggtaaac 
ttgatgatcc acttgatttt agttttagga cctggatgca taggcagatg tcagtttaca 
aggattctgt gtactttaag gaatgttttc tgagcatgtc cagtacaaca gacgctctgt 
taggtagctg tagttaggat tttttggttg taagtatgtg aagatttaaa tgtatcagct 
cacttactca gaaaatctga ggcagtgcta gccaaaccaa atggttcaag caaatgtcat 
cagtatttgg cctcttccag tctttttact cctctatcct ctgtgtctgc ttcacttcta 
cacaagcttt ctctatgtgg tggctccaga ttttatatct tctagtagat atttttttaa 



780 
840 
900 
960 
1020 
1080 
1140 
1155 



<210> 45 
<211> 1112 
<212> DNA 

<213> Homo sapiens 
<400> 45 

gccggaggaa gagcgtctgc aaaactgggt tcctagaagt atagacggac ttagcttttw 60 

gtagaatttg gtgaggagca gcgcctcgtg agagcagaat ggcctggcgt ggccagtgct 120 

tcccggcagc acgcagctct gcggcctcca gaattcccct gttctgagct tgatgcccct 180 

agcctgtccc ctacctactt cctcccctcc tctctagccc tctcacaggg gtgattgcta 240 

cctctctgtt ttcttgggcc taggcaagtt ttagaggagt tcccaagcat tgttatgagg 300 

ccagtgtgct cgctgggctg ggcgggatgg cctgggcttg tgtgtggcct gagggctctc 360 

ctggggcctt ctcttttccc agtcaccttt ggagccacag aagcagtgca ctcattggat 420 

gtctgttctt aacacagctt ctctttctac attaaaaaaa atcattattg cattttggaa 480 

agcagtgctc atcaaaagca acttttaaaa cctattttat tgttccttta aatgttctct 540 

cccgctgaaa ctgccctgga gaggctatct gctgctcttc catttaccca catcaggtta 600 

ttctccatgt cactcagtgg agatgactcc agatgtgttt aaagmctgga caattcacct 660 

atactgtgta ggaaattacc tccttaatta cctggtmgaa ttgtcagcag acatgttcat 720 

ccgatgatag tactgcagtt ttctattaat aatttgcaga cttttatcta acctgcactc 780 

atgtacagat tattaaaagt tttaaaatgt aactgatcag tattgatcaa tcattgtctt 840 

gatttttttt tacagcgtat atttctaatc atatttttta aagccaagag aactggttga 900 

atgaatgttt attttcctga aggtattttt aagataaagc ttcctaatgg cgtgtaaact 960 

ttgcatatgt atgtagtttg atacatattg tcacatttga aaatcttgtg ggttgtaact 1020 

ggttttatac aaaatatcga atagtggaaa ttgtataatt acaatcatgt aattaaaagt 1080 

attaacccaa aaaaaaaaaa aaaaaaytcg ag 1112 



<210> 46 
<211> 4023 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (1049) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (2758) 

<223> n equals a,t,g, or c 
<400> 46 

cccacgcgtc cgtccaaaca tcaggaggca ggcagcatgg taaatgagaa agaagccagg 60 
actgggagtc caaagtcctg gcttctatgt ctggctttgc tactaatcaa atatgtgact 120 
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ttttgcaaac catacctcac taaaccttac tttcttcatt tgagcgtgtt ggaccagctg 180 

tccccaggaa cccccttgga ttgatctgag aaggcaagga taagtttttc aaaggaagaa 240 

aagaggagta gtcagtccgc agtacagtag acacaagccc caggacatct gagtgtcttt 3 00 

cagcaagaac tctctgtgat atttcactac aatttctctg gcaccttggg actctcctca 360 

gcccttgtgg tggtgggtct tgtttaacta gcagttccct ccattctatg cctgtgaaga 420 

atctatcacc taccatgtga ttacagtgca gatttttttt tccttttcct tttctttttc 480 

tttctttttt tttttttttt tgtttgagac ggagtctcgc tttgtcaccc aggctgcagc 540 

gcagtggcgc gatctcggct cactgcaagc tccgcctccc gggttcaccg ccattctcct 600 

gcctcagcct cccgagtagc tgggactata ggcgcccgcc accgtgcctg gctaattttt 660 

tctattttta gtagagacag ggtttcaccg tgttagccag gatggtctcc atctcctgac 720 

gtggtgatcc gcccgcctcg ggctcccaaa gtgctgggat tacgggcgtg agccactgcg 780 

cccggcctac agtgcggata ttttatgaga gaggagatca caactcagtc cccaagccct 840 

caacccttaa tacatactat cgtatgaaat gcctctttcc aaattcagcc ttttctaaaa 900 

ctcaagatga gaaaactgct gatgaggctc actttctaaa ataccggaat ttgcaatata 960 

gggagaatag tttttcatgt ttctttgttt aagcaataga aagaaaggaa acttatgtcg 1020 

tttacttttc aggccataga ggttttcana acaacttgaa aacatgatca aattagccaa 1080 

acttctgata gttttcaatg tagtctgtga tcatgggata atttagcctc agttcttttt 1140 

ctgaaattgt gttttgaatg tttgatttga cttatttacc atcaaacttg ctataaggtt 1200 

attactctaa tgaataagca tattccctta attgggacaa tttactatta tttctttcat 1260 

aaagtagggc accattcacc atctatttcc tggctcttta gttatcaaaa tgttaagctc 1320 

attgctattc atcccggcac agcacttata tgagaggcat gaagctggct gaattctgca 13 80 

tcattaggaa tgacacagcc tcatcacatt gacaccagtg cttgtctctc acaccaatcc 1440 

aaattaagac caactgaaaa tagtcagagt ttcctctgga gctccttttt gaagagacat 1500 

atgtttttta gtctggtggt acccaaaatt gaacaaaaaa tgggtgctgc ttctcttaat 15 60 

aggcaaaact atgctgcagg ataatgtatt catgcagggt cttccagcca gaccccaaat 162 0 

catccctccc ttcactagaa tttttctgtt taattcgatg gccactctcc acagggatcc 1680 

attctgtgtc ttattacagg agatgctcaa tgaatgaggg acttatcttc tagaaatgca 1740 

gctccgaggt agtctgttga gtgaaataat gaatccatta tcacaaaata aattgaaagc 1800 

tgtctgacat ttggacaatt tttattttgt ttcacattgt tctgaaaact atactgtttc 1860 

ttttctccct attatttaaa taagcaaatg atgaacagat tacaaaattg aggacactcg 1920 

aggtagggaa ggagcccctc gacaggagga tcaggacata gtaccaaggg caagagaaac 1980 

gattcaataa acactattta ctatatattt taggcatggt tctaggtaat cacatgataa 2040 

gtagttgaaa gaactgaaaa tgttttatct gcaagaaaag ggcaagtgta atatcttcaa 2100 

attttagaaa gaatgtaaat tagaatttga cttaatttgg tgtagttctt gtgggcagaa 2160 

attgaattga ataggctgaa agttataaga aggattttag ctcagtattg atactggatt 2220 

gctcatgggt ggtgagagtt actcatcact ggaagagttc aagcaggggc cataagaaat 22 80 

ctcagggatt ttataaggtg attcatgctc tgggaaaagg atgccttgga ttattgtgtc 2340 

agggtacttc taactctagg attctggttt ctaagatctg gactctagtc ttgccactca 2400 

cctgccatca agaacatgtt cctcatctgc aggacaggac caagatggct ctgtctacct 2460 

taccgggttg ctgtgaggcg tgattgtgat aaaatacata aaggcagttt ttaagctctg 2520 

aagcactagt taaatgtgta gcgtatttta agattctgtt gtatgtacaa ttgtttagca 2580 

gtctctctct ctttctttct ctttctttta tcagagatag atgattttcc ctcttatttc 2640 

caccagtttg gcttttcagg gaaggtggca gctggcagaa tcccctgaca acaaaaggta 2700 

cagcaaaaaa gtggaggcct aaagaaaaca tgtgctagct ctttagcccc tgaatagnta 2760 

agtcacatgt cagcctgctc tccttcatct gtttgggagg aggcagatta gagtcacact 2 820 

gtcatcatgc tctttcccct cagaagcagc tgtaaggttt ttggtagctg tcagtgctag 2 880 

caaacagtgc ttttctcaca gaactactgg aaagagtcct ggctcggaaa acttgctctt 2940 

gaaagtggca cggccagagc aggggtctct agagggtcgt gccacctcta cctgccacag 3 000 

gttccattgt cggtcaggta agttagaggc agcagttccc cacctgccct ctggataaca 3 060 

gcagcctggg gctgctcctg agtcatgttt ccacttctgt cttacaggcc tcattttcct 3120 

acccatcttt ctgtaaaaat gaaagtcagg agtcttatga aacttaccat tattcaatac 3180 

aggcttttgg tttttttctt taaattagat agggttaggt aagaagtaga gttctataga 3240 

acgttcatag gaagcaacaa aagttgatct cttggtctct acaataggag aggattgggc 3300 

tagatacctt caaagctgac ttgccctaat attctagtat gaaatgattc gaaggtacac 3360 

ctgcccctat catgtcaggc agtgagtaca gttaaaacat tgggaattgg taaaggaaag 342 0 
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aaaaaaactg 
tcctccytct 
tatccagggg 
actatcttct 
atgctctttt 
ccaacactgt 
ttccttcagg 
gaatcttggg 
aacccacctc 
catgccgtgg 
aaa 



aaaagaaccc 
ttttcttycc 
acaggcccct 
ctgtggccar 
cccacccact 
cgctacagta 
agcaggcatt 
caagttacat 
acagggttgt 
caccaagtaa 



tttgaagtta 
acagcttcta 
ttggctccaa 
cgcagctctc 
ggaaggctca 
aggacctgaa 
ggtagtgcag 
agcctctgtg 
tgtgaggatc 
gcactcaata 



gacaaactgt 
gaattcctct 
cccacacgcc 
ttctgtgttc 
caggcaaggt 
gtgactttga 
aggcacagat 
agcctcatcg 
caatgagttg 
aatcactcaa 



ccagagacat 
ccagagctac 
tgaactttaa 
acagaatggc 
gagagaggac 
gaaattcacc 
tccgtccttt 
gtaaacagtg 
atttaggtaa 
ctcctttaaa 



agtgctaaaa 
tctcaagtta 
ggatcattgg 
catgataggc 
acagaaggtg 
ctcacaaacc 
accagctgca 
ggggttatga 
gcacctagca 
aaaaaaaaaa 



3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4023 



<210> 47 
<211> 542 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (389) 

<223> n equals a,t,g, or c 



<400> 47 

agggcacgag 

tggatctttt 

ctcagcctta 

ctgtagggtg 

acctccagct 

acattgcaga 

agactgttag 

ctactcaaat 

ctccacagct 

at 



tttttttatg 
cagctatgca 
gcattgttga 
tttagcagca 
cagttgtgac 
gaaattgagt 
attctttgtg 
gtatttggga 
cttctccctt 



actacataat 
catggttagt 
cattttggtt 
tccctgatct 
aactactgaa 
ctggttgaga 
ttcgttgtnt 
tcagccactg 
caaacactgt 



gtttattgcg 
cataatgatt 
aattctttgt 
ctacctacta 
tgtctccaga 
agtcactgtt 
ctggcctgta 
tcttccattt 
tcctcagcat 



atctatttta 
gtcattttag 
tgtaggggct 
aatgccagga 
caactactga 
ttagggcata 
taactcttct 
ctcttttgct 
cttgtttttt 



aggcttttca 
gtcagagttt 
gtcctgtaca 
gcaacacagt 
atgtctccag 
atttttgggt 
taattatctg 
cacagatcta 
gcagccaaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
542 



<210> 48 
<211> 1495 
<212> DNA 

<213> Homo sapiens 



<400> 48 

cggcacgagg 

tgttatttgt 

gaaaagtagt 

tcaatgttct 

agctgggatt 

gtcctatcaa 

tttccctgct 

agttcccaat 

ctaatgaagc 

aaagagacta 

gaaatagtta 

tgcactgtgc 

atcacgtcac 



ctacttatat 
tgattgggtt 
gtaagcagta 
tacacattgt 
atgcagcttt 
cttctggaat 
atgcaaaacc 
tctttcttat 
ccattccttt 
atcactcaat 
aatatggatt 
tgatgcaaga 
agtatttctg 



tttatgaagg 
tgttttttgc 
ggaagaaaat 
attactgcat 
gcaagaatct 
tatctaatta 
tttcccagac 
tacaggctca 
ttgtacatga 
atgaaaacat 
attttgtcct 
attctacatt 
ctctatttat 



acattttttg 
ttgttggttt 
gaggaagatg 
tgtggtaata 
actagatttt 
ttgcttttaa 
cttggtttct 
ggtgtacagg 
agatgtcact 
gaaaacattt 
tttacttttt 
ttaatgaatt 
tcatatacat 



ttagatgatc 
gtttgtttct 
tattttgcat 
gcttctataa 
attctaactc 
aagtttcctg 
taaaagaaag 
ttattctggc 
taaacctatg 
ttgcttaaaa 
aaaaaaagt t 
ataaaattat 
agaaatatat 



tcatcctctg 
tccatgtaag 
gttcttcctt 
aatctgccat 
atattagctt 
cctttcaacg 
atgttgctac 
ttaattttat 
tttacaaact 
tattaagatg 
acatattgta 
tctgcatctc 
a tgggcttaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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tcatttaaaa tttgttgcag caagaactct cctacctgta ggcaatagat tgctatgttt 
tcaacaaatt gtggcaaatt ctaaacagca attcttttgt acgtaatagg acatttcata 
ctagaaaaat aaagtaatgt ttttgacatt ggatttggtg cagtttctaa tgaagcaatg 
gttggttggt taatatgtct tctgtagctg ttagcattgc caaattaaaa agggtaaatt 
ttatggaaat cctgagacca ggaagatatc aatttcatgt gtacttaatg gtataaagtg 
ttttacagtt tctatcacca tacaaataca taaagacatt ttatagtttt atcaactata 
gagctttagt ctttcaaaag taatttttga aaaacataca ttcctggcca ggtgtggtgg 
gccacgcctg taaccccagc actttgggag gccgagggag gggggatcac ctgaggtcag 
gaatttgaga ccaacctggc caacatggtg aaaccccatc tctactaaaa gtacaaaaat 
tagccaggca tggtggcagg cacctgaaat cccagctact agggaggttg aggcaggaga 
atcacttgaa cctgggaggt ggaggttgca gtgagccgtg atcacgccat tgcactccag 
cctgggggac aagagtgaga cttcatctca aaaaaaaaaa aaaaaaaaaa aaaaa 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1495 



<210> 49 
<211> 818 
<212> DNA 

<213> Homo sapiens 
<400> 49 

aaaacttgag tatgttgagg gaaggaatat atatatatct gggagagaat ggatacgttt 60 

tgtttttctg aaatggaatt agaaagatgt tcagttgtct tgtgcattct tgcaaacctt 120 

gcagttttga gagccctgtt tctgccttgt atcattttcc actgtgtatc kgattctagg 180 

agcgtgaaca gggagacaaa ggtgaagttt gtgcacacct ctgtccatgg ggtgggtcat 240 

agctttgtgc agtcmgcttt caaggctttt gmccttgttc cycctgaggc tgttcctgaa 300 

cagaaagatc cggatcctga gtttccaaca gtgaaatacc cgaatcccga agaggggaaa 3 60 

ggtgtcttgg taacctaatt tttttttaaa ttatgaaatc tgcttttata ttcaaaacta 420 

ttactgtcaa gtaaaataca tttttatgtg ttttcattgt gctgaagaaa aactaatttc 480 

agcatggaaa tatgtatgtt tggctgggtg cagcgtctca tgtctgtaat cccagcactt 540 

tgggagacca aggcaggcag atcacttgag gtcaggtgtt cgagaacagc ctggccaaca 600 

tggcaaaacc ctgtctctac taaaaataca aaaattagct gggtgtggtg gtacatgcct 660 

gtaatcccag ccacttggga ggctgaggca ctagaattgt ttgaacctga gagatggagg 72 0 

ttgcagtgag ctgagattgc accactgcac tccagcctgg gtgacagggt gacagagcga 780 

gactctgtct caaaaaaaaa aaaaaaaaaa aactcgag 818 



<210> 50 

<211> 1711 

<212> DNA 

<213> Homo sapiens 



<400> 50 



ggcacgagcg ctcctgtcct gccactgagg gacccggtta ccaaccctca tgtagctcag 
tttgcccatc tgtcccggtg ctaacacaca gttctcggga gactttcccc attcccagag 
gagtagtgcg aaatgcgtgt acctctagtc ttaagctggg cgtttgtatt agttgggttt 
tctggtgtct atttagcaag tgaaagtttc tggttccctc cttcactgtg tgacctgact 
agtcctcctg gattgcattt atggaagttt atacgagacc tagtttccat ggaggaactc 
actgattccg cgagggagat ggggtactgg atgatggtct tcagccttaa ggctatgttt 
ccagtgtcct ctgggtgttt ccaagagcgg caagaaacga ataaatctct gacccttctc 
aggtgcagcc agagagacac tagcccactg atggacggac agacgtgggc aagggtccgt 
gtcactaaac cacccaccac tgccacagct gcctacaaca gacacatcag atgacactcc 
gggcaaataa atgattttca ctgaggactt actggtttta ataataggtc ctggtgtaga 
gaagtccctc aacctattgt gcaacgagtt ttgagaagcg ggtaagctgt atgttttgtg 
gttttgtttc ataaattcat ctacaggaag accaatattg actgaatgaa gctttcattt 
aaagagctaa aatatgcttt gtgtttttat atgtggatac tactttaaac ctaacgacta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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ttcattgtat catagcttgt gatgtattct gctcatggct tttaaggtaa attgtgccat 840 

gatccactgc cattctaatt gctttaacaa gtcattacca cactactgtt acatcttaat 900 

tatgcataca gacaggtaga cttgttttac atatgtgaac taactagttg tcaaagcaaa 960 

tgcagattgt attctgcaag taaagtcttt ttctctctga aatttctagg gatgttcttt 1020 

aagtgaaatt catattaaaa ctgaagattt tagttacaag aactgagtgc agattaagtc 1080 

tttgtgattc aacatagtca agatacaact gtggatattt catggaagta tgcaataaaa 1140 

tgtctctacc tggaaaaatc tatcaagcag cgtcacagta ctgaatttga aaccagaaat 1200 

actgggtttt tatataaatg cttcatagat ttgttttatg ataaagggca cataactctc 1260 

ctaaacctca caccacctct tgaataggta taataagtcc acatcaatgc tgatgcctta 1320 

gctattatta aactcttaca gtatgatgta aagtgaaagt acaatgtaag atcattccta 13 80 

ggccaacttt gaccagtttt atacagaaac atgtgccaac ttttctgttt gcaaggataa 1440 

tatcaaagca aacaccagaa agttatatct ttgatgcatt ttttcaaaat catacacata 1500 

atacacaaac caaagacaaa tgatgaatat tacgtcagaa aatataaagt cttccccttt 1560 

cttcttttgc caagaaagtc caatattttc accattttta tgcacacaat caactttatt 1620 

taagctggaa gttaatgtct cattgttttc attgttctaa ataaacacct tttcccttga 1680 

gtattgctct aaaaaaaaaa aaaaaaaaaa a 1711 



<210> 51 
<211> 749 
<212> DNA 

<213> Homo sapiens 



<400> 51 

gccaaaccag rtaataattt ccttataata catgaagtcg ttattttgca tttattttct 60 

taggtggcca atggggttat cttgggggga gacttttata ctcctaaggg acagcttggc 120 

cattaacttt caaagtttct ctaaagcagc gtcaggagat atatttggtt gtcatgacta 180 

gtggcattcc actgacatgt aatgggtaga ggctgggtag acatcctacg atgcacaaga 240 

cagcctccca caataaagaa ctgtgtggcc caaaaatatc agtgatgctg agattgagaa 300 

acttaaagaa atttaaaaat taactctata caaaatctaa tgtttgagtt ttctccatgt 360 

atctgtgact gcaatgacca gagtgactgt ccataaagaa agtgctaaga gttggctggg 420 

tgcggtggcc tacacctgta atcccagcac tttgggaggc caaggtgggt ggatcacctg 480 

aggtcaggag ttcgagacca gcctggccaa catggcaaaa ccccatctct actaaaaaat 540 

acaaaaatta gctgggtgtg gtggcacgca cctgtagtca cagctactca ggaggttgag 600 

gcaggagagg tgcttgaacc cgggagatgg aggttgcagt gagccgagat tatgccattg 660 

cactccagcc tgggtgacag agtgagacaa aaaaaaaaaa aaaaaactcg agggggggcc 720 

cggtacccaa ttcgccctat aggcagtcc 749 



<210> 52 

<211> 1091 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1079) 

<223> n equals a,t,g, or c 



<400> 52 

ggccagtggg cagggtcaca gggcaaggtc ccgcgggccg ctgggtgcgg cgacttccgt 60 

gctcccggcg agcgggcgga gagcgggggc cgcactgggg agtgtgggct gggccgcaga 12 0 

tgtcatgtgg cctgtktttt ggaccgtggt tcgtacctat gctccttatg tcacattccc 180 

tgttgccttc gtggtcgggg ctgtgggtta ccacctggaa tggttcatca ggggaaagga 240 

cccccagccc gtggaggagg aaaagagcat ctcagagcgc cgggaggatc gcaagctgga 300 
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tgagcttcta ggcaaggacc acacgcaggt ggtgagcctt aaggacaagc tagaatttgc 3 60 

cccgaaagct gtgctgaaca gaaaccgccc agagaagaat taatggagga cacagggccc 420 
tatggtccta ctgtgggtgg tgacttgtcc tgctaccatg ttgacagagc cccagaaccc 480 
acatctaatt ggctttgttg cttattctgg cccttcccac accacacagc cacacaaata 540 

ctggctgctc cttgatggcc aggcagaccc agcagcagcc gaggggccag tgaagaggaa 600 

ggccgcatct gttgtgtggt ggccacaagc actcaggcat ctgagtttac tggtgcactg 660 

ctgggaggag agttatgaga tgaacattgg ctgtcaatct ctgtgggcag gcggtttggc 720 

ctctagtggg aatggctggg atttgggcgt tgcctttagg agggatacct gcatgtctag 7 80 

ttccagtctg cactggaaag aattcaaata tgcacctggc tcccttcact attttgccct 840 

atcctttgtg ctcattctta ctgaaatctg tcttgtcagc tcaggaatgg gattccccca 900 

ggaaggaaag cacttttctg ttctgggaag cccagactgt tcactttggg gcagggacga 9 60 

acatgtgcct cgtgaatttg cttgaaaaca gtcaccatct tctaccccca tcactgtata 1020 

gtgaaaaacc tgattaaagt ggtatctgag aaccawaaaa aaaaaaaaaa aaaactcgng 1080 

ggggggcccg g 1091 



<210> 53 

<211> 2254 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1182) 

<223> n equals a,t,g, or c 



<400> 53 

ggcacgaggc ccgctgcaat gttatcatca cccaacctcg ccgcatctct gctgtgtctg 60 

tggcacagcg ggtcagccac gaactgggcc cctccctgcg ccggaatgtg ggcttccagg 120 

tgcggttgga aagtaagccc ccatcccgag gcggggccct gctcttctgc actgtgggta 180 

tcctgctgcg taastgcaga gcaaccccag cctggagggc gtgagccacg tcatcgtgga 240 

tgaggtgcat gagcgggacg tgaacacaga ctttctgctg atcctgctca agggcctgca 3 00 

gcggytcaac ccggccctgc ggctggtgct catgaktgcc acaggggaca atgagcgctt 3 60 

ctcccgatac tttggtggct gccccgtcat caaggtgcct ggcttcatgt acccagtcaa 42 0 

ggagcactac ctagaggaca tcctggccaa gttgggcaag caccagtacc tgcaccggca 480 

ccggcaccat gagtctgagg atgaatgcgc actcgatttg gaccttgtga ctgatctggt 540 

tctgcacatc gatgctcgcg gggaaccagg tgggatcctg tgcttcctgc ctgggtggca 600 

gagatcaaag gagtgcagca gcgcctccag gaggccctgg gcatgcacga gagcaagtac 660 

ctcatcctgc cagtgcactc caacatcccc atgatggatc agaaggccat attccagcag 720 

cctccagttg gggtgcgcaa gattgtcttg gccaccaaca ttgctgagac ttccatcaca 780 

atcaatgaca tcgtgcatgt ggtggacagt gggctgcaca aggaagaacg ctatgacctg 840 

aagaccaagg tgtcctgcct ggagacagtg tgggtatcaa gagccaatgt gatccagcgc 900 

cggggccggg cgggccgctg ccagtccggc tttgcctacc acttgttccc tcgaagccgg 960 

ctggagaaaa tggtcccttt ccaagtgcca gagatcctgc gcacacctct tgagaacctg 1020 

gtgctgcaag cgaaaatcca catgcctgag aagacggcgg tggagttcct gtccaaggct 1080 

gtggacagtc caaacatcaa ggcagtggac gaggctgtga tcttgctcca ggagatcggg 1140 

gtgctggacc agcgggagta cctgactacc ctggggcagc gnctggctca catctccacc 1200 

gacccccggt tggccaaggc cattgtgttg gctgccatct tccgttgcct gcacccacta 1260 

ctggtggtcg tttcctgcct cacccgggac cccttcagca gcagcctaca gaaccgggca 1320 

gaggtggaca aggtgaaagc actgttgagc catgacagcg gcagtgacca cctggccttt 1380 

gtgcgggctg tcgccggctg ggaggaggtg ctgcgttggc aggaccgcag ctcccgggag 1440 

aattacctgg aggaaaacct gctgtacgca cccagcctgc gcttcatcca cggactcatc 1500 

aagcagttct cagagaacat ttatgaggcc ttcctggtgg ggaagccctc ggactgcacc 1560 

ctggcctccg cccagtgcaa cgagtacagt gaggaggagg agctggtgaa gggcgtgctg 1620 

atggccggcc tctaccccaa cctcatccag gtgaggcagg gcaaggtcac ccggcagggg 1680 
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aagttcaagc ccaacagcgt cacatatagg accaaatcag gcaacatcct gctgcacaag 1740 

tcgaccatta acagggaggc cacacggtta cggagccgat ggctgacgta tttcatggca 1800 

gtcaagtcca atggcagcgt cttcgtccgg gactcctctc aggtgcaccc gctagctgtg 1860 

ctgctgctga ccgacgggga cgtgcacatc cgtgatgacg ggcgccgggc caccatctca 192 0 

ctgagcgaca gtgacctgct gcggctggag ggtgactcgc gtaccgtgcg gctgctgaag 1980 

gagctgcggc gggccctggg ccgcatggtg gagcggagcc tgcgcagcga gctggctgca 2040 

cttcccccca gcgtacagga ggagcacggg cagctgcttg cgctactggc agagctgctg 2100 

cgaggaccct gtggcagctt tgatgtgcgc aagacagctg acgactgagc cctgcttctg 2160 

ctggggctgt gtacagagtg caaatgttta tttaaaataa agttctattt atcccttgtg 2220 

aaaaaaaaaa aaaaaaaaaa aaaaaaaact cgag 2254 



<210> 54 
<211> 486 
<212> DNA 

<213> Homo sapiens 



<400> 54 

cacactgaca tctccccaac aggtgagggc agggagagct ccagacaggg agaggccttc 60 

agagaacagg aaggaagctc cctccctcct ctgcattttg cagcctgtag ctcacgtgcc 120 

ttttatgccc cacatctcat tctgtctggg gactccatac gtagtggctg tctaccttcc 180 

cgcgtggatt gtaatgcttt tgctaccagg ggtcaggcca tactcatcac tgcaggccct 240 

gaagcatcca tcatgttcct cgagctcagt atgtgctccg tacatgtagc acagtggaaa 300 

aacttgagct ttgctggcaa agacagacag aatgagcttg aatctcagcc cagctatggc 3 60 

ttttctagtc ctgtggctag aaaatgactt agcctcttgg actttggtta acccatctgc 420 

aaaacaggga tggcacccac ctcttagaaa gttacagtgg tcaaaaaaaa aaaaaaaaaa 480 

ctcgag 486 



<210> 55 

<211> 1270 

<212> DNA 

<213> Homo sapiens 



<400> 55 

gaaaccatcc aagataagag acatgggagt gaaattcaca cccactctgg ctttcatacc 60 

atgggtctga acattagccc atggtgtttc ttggccatac tgacctgtgc catttcagct 12 0 

gcattcatct cagttggtgt tgtctgctgg ctgctctttc tgatttccca caggagcagt 180 

aagaacctga ggaagagtag ggtcagagga gtctgggaga atgaggaaat atgagagccc 240 

caggaactga aaaggcctgt gagagactct gagcttcctg ggaacaggta taggttcttt 3 00 

ttatttcaat aataacagaa acaactgtca aaaccatgtg cctgtactat ttggagtgct 3 60 

gtccttgcag aatctcatta taagaacctt aggaaatagg cacatcatct cctggataga 42 0 

atcctaggaa atgggcacta taatgggcac tttatcccat tttataaaca tggaaattga 480 

ggcacagaga gattaagtac tttcccaagg tcatacagct agtgatggag gagctagcat 540 

ttgaacccsg gagtttttag tctattgagt ttaaccgaca gatcatactg tgttttggta 600 

gggaggragg gtgaagcaag caartgaaca aatgartctg ggatttarga cttgccagac 66.0 

aaacaaggcc caagaggcaa gtgtgcaggt gggtgtagtt gggagtcagc agagttgggt 72 0 

tggaattcaa gctttgccac ctgctggcta taaaccttgg ttgggtaagt aacccaaggt 780 

aaatgagatc atctctgtaa aactcttagc cttgtgcctg gcacatagta aatgcttaat 840 

aagggttcac tgttagtatt actgttactg ataacataca aatagattgt attaatggac 900 

cataattgca actgtataaa acaaattcca tgtttggcca ggcgcagtgg ctcaagcctg 960 

taatcccatc acttcgggag gccgaggtgg gcagatcacg aggtcaggag atcaagacca 102 0 

tcctggctaa cacagtgaaa ccccatctct actaaaaata caaaaaaatt agccaggcat 1080 

ggtggcgggt gcctgtattc ccagctactt gggaggctga ggcaggagaa tggcgcgaac 1140 

ccagggggcg gagcttgcag tgagctgtaa ttgtgccact gcactccagc ctgggcgaca 1200 
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gagcgagact ccgtcycaaa aaaaaaaaaa aaaaaactcg agggggggcc cgaacccaat 1260 
cgccctatag 1270 



<210> 56 
<211> 2059 
<212> DNA 

<213> Homo sapiens 



<400> 56 

ggcacgagcc 

tgcacagaga 

aaaaagtttt 

aagcctttgg 

agtggtaaaa 

ggtctagggg 

aggagcactg 

ctcttagggc 

ccccgcctgt 

ctgtcaaagc 

agacttttct 

gcaggaaatt 

aagatatgaa 

ccatgcagat 

tttgggagga 

tccacatctg 

tggcgtcaag 

ttaataggaa 

caagttggct 

gcagagcacc 

gcgggtgccc 

gaatcatgag 

tcgggactag 

cctaatccga 

cagtgctcag 

cctgcgtctg 

tttgaggctg 

ataggaatgt 

ctctgcagcg 

tatagtgtct 

gcctgtgatc 

gaggttgcag 

tttgtctcaa 

gcccagaagt 

taaaaaaaaa 



tcactgggta 

agcactaagt 

agggaacatt 

gcacttttca 

atacctcttt 

cctccaggtg 

ggagggacag 

cctgccactt 

gcggctggat 

ggaaaacagt 

ccttcatgac 

ttgtttcctg 

ccgagagccg 

aacacagcag 

accgctgggc 

ggctgagttc 

agaagtaaag 

agattttttt 

ctgctgaagt 

ttcctgctga 

aagagggctg 

aggttctcag 

ttgtgtttag 

acttcagaaa 

atatggtcag 

acagaagctc 

caccgtttct 

gctcgggcag 

tctttcgggt 

tggggtcctg 

ccagctactc 

tgagcggaga 

aaaaaaagga 

tctgatggag 

aaaaaaaaa 



aacacaagct 

caaagtcatc 

aactgttaat 

aacctaacaa 

gttatctaag 

tacggctttg 

tggaggaaga 

actttgctca 

ctgggctgcg 

tcacacacca 

cctcatctct 

cttgtcttca 

tggcagctgt 

ggtccaggcc 

aggttctgcc 

ccaaagaaag 

tgttcagatg 

ttttatccgt 

agaaatggtg 

ccacagctgt 

agggctgcgt 

ggctgcctcc 

gttttcttaa 

tccaaaatgc 

accctggagc 

cagagaagtg 

ggaagtcaag 

ggaacccgga 

cctgtgggtc 

caggcttggc 

aggaggctga 

ttgcgccatt 

acgtgcctca 

cttctgtcag 



ctcggcaggg 

tccgtgtggt 

ggcttatgta 

acaactgatt 

acaactttta 

aagtaagggc 

ggccccgccc 

gagcaaaagg 

tcagcaccgg 

gagcctcatg 

gctctcgggc 

aagacttcag 

ccagcttgca 

cttccctctg 

caggaacagt 

tgcgcctaac 

gcagtgattt 

ttggttttta 

ggcggggagt 

gagcgccggc 

ctgccatggt 

cacaggcttt 

aattctgtag 

tccagtgagt 

actttgcatt 

gctgcgagtt 

ccccacagtg 

gcaccagccg 

tggctgtgcc 

tgtacccggc 

ggcaggagaa 

tcactccagc 

ttcagtggtt 

acacaggctg 



aacaagctct 
cattaaaatt 
ttagctgttc 
gaatttctat 

ggtggcatta 

aagaagcatg 

gggtgccact 

tgccgtcagc 

gacccgcccc 

tggaaagagt 

ctcagcccgc 

ggcttagaac 

ggctgatatt 

gaactcacac 

taactctgca 

aacgtcttgt 

taatgaacct 

tttttagaat 

cgccactgac 

tgtacgcagg 

gtctccacct 

ctgtgtctta 

taattgcatt 

atttcatttg 

ctgaagtgaa 

cagggcaaga 

gacctcgagt 

ctgggcccct 

cagccctgct 

cccagtgcat 

tcacttgaac 

ctgggcagca 

cgatggtggt 

agtatccttc 



cggcagggaa 

ccagtgaatg 

tctgttttaa 

tgatggtcaa 

agaccccgag 

ggtgccctct 

gctgaggcag 

tgggtcctac 

agcagcacat 

ccaggcgctc 

ctgtcgtact 

ggaccataga 

tgtgtagatg 

tcggctgtat 

gagcacagtt 

ggaaggcggt 

actagctatt 

catgaaatag 

catcgtctgc 

tccctgctgt 

tggacaccat 

cctgggacac 

gtagagcatc 

agcatcatgt 

ggatgctcag 

ggctcctggc 

ccctctgtga 

cgttccccgg 

gcccgtggac 

ggtggcaagt 

ctgggaggca 

agagcgagac 

tctgatcaat 

accccaaaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2059 



<210> 57 
<211> 868 
<212> DNA 

<213> Homo sapiens 
<400> 57 

gactgactat agggaaagct ggtacgcctg caggtaccgg tccggaattc cgggtcgacc 60 
cacgcgtccg ctgaatttag gagacttttt acccaggggc aaaaggctct tagggtaatg 120 
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agatggatgg tggcccaggt gcattttcca gggcctgggt tctccagatc ccgtggcttc 180 

tgttgagtgg aggcaacttt gctctgtgtg aacctcgccc ctgtccctct gccgggcacc 240 

cctggcagga agcaggactc ccatcctcac cctgacttag actgtcctct gagtcagctc 300 

ctctccaaga caggagtggg cagccctggg cagtcttctg gccccttgct aaagtgaggg 360 

scaggaagct ggggctgccc tccagaaagc cggggtaggr actctgaaaa atacctcctc 42 0 

taaacggaag cagggytctc cagttccact tggcgccccc tcccacaagg cccttcctcc 480 

ctgaggaccc caccccccta ccccttcccc agcagccttt ggaccctcac ctctctccgg 540 

tgtccgtggg tcctcagccc agggtgagct gcagtcaggc gggatgggac gggcaggcca 600 

gaggtcagcc agctcctagc agagaagagc cagccagacc ccaaccctgt ctcttgtcca 660 

tgccctttgt gatttcagtc ttggtagact tgtatttgga gttttgtgct tcaaagtttt 720 

tgtttttgtt tgtttggttt ttgttttgag ggggtggggg gggatacaga gcagctgatc 780 

aatttgtatt tatttatttt aacattttac taaataaagc caaataaagc ctcaaaaaaa 840 

aaaaaaaaaa aaaaaaaagg gcggccgc 868 



<210> 58 

<211> 986 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (592) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
<222> (669) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (767) 

<223> n equals a,t,g, or c 



<400> 58 

gaaattaagt catttagata aaaatatgcc attcttatct gtttggtttt ttaatcttgg 60 

cttaatattt ggggttgagt catttgtttt gagagctgtc ctgtttattg cagggtgttc 12 0 

agcaacatcc cagatggaag cagcatcccc ctacccagct gtgacaaaaa gaaaaaaaaa 180 

tgtctccaka cactgccaaa tatcttctgg gggtgcccct ggttgagaac cactgcttta 240 

gtggataaac tttaggcagg agggaaatga tcgcagttgg atagttggag gaatgtggag 300 

caagggaagc aataaactgt gaccataaaa acatagaaag atggcttata tgtggatttt 360 

tttttaaagc acgtagaatt gcttaaaatg gacaacagca gcatataaat cagtggcaga 42 0 

gttggtggct gaatttagag catcttaagt ctatgttctc ctggaacaga gtgcagataa 480 

ttcagttatc agcttggcta ggtgcatgtt gaagtattta gtcacacaca aacagttaat 540 

gtatggggaa gataacttct atactagtag gagagaaatg gaacaagaat wntaaataca 600 

ctatcaaaat atgcaagaat ggcaagmgga aaaggcagaa caagctgcaa aacmcacaca 660 

caattagana taaatatttt gggacacaat aaatgtgaat ggattaaaac ctctgttaat 720 

gacaaagttc tccagttaaa ggaaggcaaa tagtgttatt aggaatngat tactatatga 780 

tgattaaagg ctcagttcaa caggaagatg attgatagaa ctttcctaca tttgtaacac 840 

agtcttagaa gatattaaag caaacattca agaagaaatt gatcacctac taccatagtg 900 

tattttattg aattggtaca tttcaataaa gtgtcataag gcacggttga aggaaaaaaa 960 

aaaaaaaaaa aaaaaagggc ggccgc 986 
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<210> 59 

<211> 695 

<212> DNA 

< 2 1 3 > Homo s ap i ens 



<400> 59 

ttttttttct tgaaataaaa tgggggagta atgggaaata atttttttga gcccttgcgt 60 

ttctaaaaat gtttgcattg tgccttcatg tttgacagtt cagttccagg ttgaaaatta 120 

tttttctttg gaatgttaac agctgccctc tattttctgt ttttatctaa tgttgctgaa 180 

gagaaatctt ctgattctta ttcttttttt ggtgacctgt tttaattttg tgtctttctt 240 

ttttttcccc tggaagcttt taggctctcc cttttatcct tgtagtctga gatctgacaa 300 

tgatggttgt gtctagtttt ccccaggata ctttttcatt tgtcctggtc agcactagta 360 

gatcctttca gtttgtgtac ttctgtttct cttgctctgg agaatttaaa aaatatatat 420 

atttttgaga cagagtctca ctctgtcacc caggttagag ggcagtggtg tagtctctac 480 

tcactgcaac ttctgcctcc tgtgtttaag cgattctcct gtctcagcca cctgagtagc 540 

tgggattaca ggtgcctgcc accacgccca gctaattttt ttgtattttt agtagaggca 600 

gggtttcacc acgttgccca ggctggtgtc gaactcctga cctcagatgg tccacctgcc 660 

ttggtctccc aaaaaaaaaa aaaaagggcg gccgc 695 



<210> 60 

<211> 314 

<212> DNA 

<213> Homo sapiens 



<400> 60 

gtcgacccac gcgtccgctt tgaggagcat tcctctagat tgcacaaggg acagtgcctt 60 

taaccaagcg aggagtccaa agctcaggac ctgactaccc tgagggcacg ctgacgcctc 12 0 

ttcccagggg gatggggagc tttctgcacc cccagtggca tctcctcatc acgttctgtg 180 

ccgtccttgg gaaaggcctg cattctgatc cttccaggcc cttcgagcat ggaggggcac 240 

tggggaaggt cccccgaggg aggagcacgt tgctgagtaa agaggtgtta ctcaaaaaaa 300 

aaaaaaaaaa aagg 3^4 



<210> 61 

<211> 734 

<212> DNA 

<213> Homo sapiens 



<400> 61 

gactgcttat atttggcatt gtcttttccc tggcactgcc actgtcacca ccatccccct 60 

tctggatccc tactttaccc cttcatgctg ctctggtggc agtgcctctg ctgccatgct 120 

gtacttgagc ctgctgctac agccatgcct gaagatgcag ccccttcctc tcttcctgtc 180 

ccaccaaata tgaccagctc taggttccat tacttctgga ctttgctcca aataaaactt 240 

acacaatttt attccaaacc caggtctctt tctgcaacac ccgagaaaaa tattgggctg 300 

caggagccag agaggagaga gagatttact ggtgagagtt gtaggtggga attgaaagcc 360 

aagtcatgtc tttgccccac cagaaactca ctaggatgta cacaatgcca ctgtgatggt 420 

kttaaaatat gtaactaacc tgcacgttgt gcacatgtac cctaaaactt caagtatata 480 

taaararaga aagaactgst gatacacata tcatgaaaaa agaccaaata aaataaaaaa 540 

ataaaaataa ataaataaaa taaaatatgt ccacaaatgc tttgatgttc ctttgtttct 600 

tgatctgtat gctagcaaca caggttcatt ccgtttgtga aaattcattg agctgtgctc 660 

ttatgagctg tgtacttctc tacatgtatg ttaaatgtgg acaagaactt cacataaaaa 720 

tcattttaaa aaaa nr>A 
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<210> 62 

<211> 1410 

<212> DNA 

<213> Homo sapiens 



<400> 62 

ccgcctcctt gccgcccagc cggtccaggc ctctggcgaa catggcgctt gtcccctgcc 60 

aggtgctgcg gatggcaatc ctgctgtctt actgctctat cctgtgtaac tacaaggcca 12 0 

tcgaaatgcc ctcacaccag acctacggag ggagctggaa attcctgacg ttcattgatc 180 

tggttatcca ggctgtcttt tttggcatct gtgtgctgac tgatctttcc agtcttctga 240 

ctcgaggaag tgggaaccag gagcaagaga ggcagctcaa gaagctcatc tctctccggg 300 

actggatgtt agctgtgttg gcctttcctg ttggggtttt tgttgtagca gtgttctgga 360 

tcatttatgc ctatgacaga gagatgatat acccgaagct gctggataat tttatcccag 42 0 

ggtggctgaa tcacggaatg cacacgacgg ttctgccctt tatattaatc gagatgagga 480 

catcgcacca tcagtatccc agcaggagca gcggacttac cgccatatgt accttctctg 540 

ttggctatat attatgggtg tgctgggtgc atcatgtaac tggcatgtgg gtgtaccctt 600 

tcctggaaca cattggccca ggagccagaa tcatcttctt tgggtctaca accatcttaa 660 

tgaacttcct gtacctgctg ggagaagttc tgaacaacta tatctgggat acacagaaaa 72 0 

gtatggaaga agagaaagaa aagcctaaat tggaatgaga tccaagtcta aacgcaagag 7 80 

ctagattgag ccgccattga agactccttc ccctcgggca ttggcagtgg gggagaaaag 840 

gcttcaaagg aacttggtgg catcagcacc cccctccccc aatgaggaca ccttttatat 900 

ataaatatgt ataaacatag aatacagttg tttccaaaag aactcaccct cactgtgtgt 960 

taaagaattc ttcccaaagt cattactgat aataacattt tttccttttc tagttttaaa 1020 

accagaattg gaccttggat ttttattttg gcaattgtaa ctccatctaa tcaagaaaga 1080 

ataaaagttt attgcacttc tttttgagaa mtatgttaaa gtcaaagggg catatataga 1140 

gtaaggcttt tgtgtattta atcctaaagg tggctgtaat catgaaccta ggccaccatg 1200 

gggacctgag agggaagggg acagatgttt ctcattgcat aatgtcacag ttgcctcaaa 12 60 

tgagcaccat ttgtaataat gatgtcaatt tcatgaaaag cctgagtgta ttgcatctct 1320 

tgatttaatc atgtgaaact tttcctagat gcaaatgctg actaataaag acaaagccac 1380 

cctgaaaaaa aaaaaaaaaa gggcggccgc 1410 



<210> 63 
<211> 1231 
<212> DNA 

<213> Homo sapiens 



<400> 63 

ggcacgagtg aatgtcgagg agttccagga tctctggcct cagttgtcct tggttattga 60 

tgggggacaa attggggatg gccagagccc cgagtgtcgc cttggctcaa ctgtggttga 120 

tttgtctgtg cccggaaagt ttggcatcat tcgtccaggc tgtgccctgg aaagtactac 180 

agccatcctc caacagaagt acggactgct cccctcacat gcgtcctacc tgtgaaactc 240 

tgggaagcag gaaggcccaa gacctggtgc tggatactat gtgtctgtcc actgacgact 3 00 

gtcaaggcct catttgcaga ggccaccgga gctagggcac tagcctgact tttaaggcag 3 60 

tgtgtctttc tgagcactgt agaccaagcc cttggagctg ctggtttagc cttgcacctg 420 

gggaaaggat gtatttattt gtattttcat atatcagcca aaagctgaat ggaaaagtta 480 

agaacattcc taggtggcct tattctaata agtttcttct gtctgttttg tttttcaatt 540 

gaaaagtaat taaataacag attagaatct agtgagagcc tcctctctgg tgggtggtgg 600 

catttaaggt caaaccagcc agaagtgctg gtgctgttta aaaagtctca ggtggctgcg 660 

tgtggtggct catgcctgta atcccaacat tctgggaggc ccaggcggga gaactgcttg 720 

agccccagga gttcagaatc agcctgggca acatagcaat actccgtctc ataaaaatta 780 

ataaataaaa agtctcaggt gaccaaaggc tcctgaagct agaaccaggt ttggataaag 840 

attgaagagc cacaggccac tcttccctct gagccattgg gcctagtggt gtcatgtatt 900 

gtaattgctc gcagggagag cagtcttttt ggtgtaatag tgggatgtct gcttagttgg 960 

caggggttca gtccaaatgg aagaatattg ggaaataaac ctccactatc ctttatagcc 1020 
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agggactttt ttcctattta ttcataaaat aaattatagt taattatacc cataacacct 1080 

ttatttaaat ccagtgttct ccgcagcctt ttgtctattt atatgtgtac caagtgttaa 1140 

acataattat tattgggcat ttgaactttg tttttcttta aagaaatgct gctattaaac 1200 

atatttgtaa atggaaaaaa aaaaaaaaaa a 1231 



<210> 64 

<211> 612 

<212> DNA 

<213> Homo sapiens 



<400> 64 

ggtcgaccca cgcgtccgag catttgtctg tataatttta gttattgaat taaaatcttt 60 

tgggacccca acaggatgag atcattggcc agctggcttc ctcccacctg cacctggact 120 

gaaattcccc gtggcattag aggtgtttcg taaggtgctc cctgctgtct gtcctacaga 180 

ttgcagtggc tctgctggaa aagaacggaa ttctatgcaa gttgcgtgtg tcatgaaggt 240 

ctctgcacag tgggtgtgtt tctttgtcgt cttttctcca ctctgctctt ctgtgaaatg 300 

tgccagcagt ggacagaaca ggggcagagg tgatcagtga ccattgcaca gaatatcagt 3 60 

aagtgttgta aggtatatag tcttggccaa caaattgtaa gcaaaatacc aggaacttcc 420 

taatctagta ggaaattttg tatgcttttg acaaacatct gatcctactg acactgaaag 480 

tccttagaag gagaattgct tgaacccgga aggtggcggt tgcagtgagc caagatggcg 54 0 

ctactgcact ccagcctggg caataggaat gaaactccgt caccaaaaaa aaaaaaaaaa 600 

aagggcggcc gc 612 



<210> 65 
<211> 2270 
<212> DNA 

<213> Homo sapiens 



<400> 65 

tttttttttt aactttttaa acaatccatt ttaatcatct aaattattta caatacaata 60 

acatggattc atccttttta agacatggga ttgtaaaaat caacaagtga atgatgcttc 120 

aaataataca tttaaataca ttaatcaaat tttttcagtg cttaaaactt tttctccatg 180 

ggacagcagg ctctggacaa aagtgcctag catacaagtt ttcccaattt ccttctatca 240 

taccagctgc acataaaaag gttcatcacc tcctgtctcc aaagtgtctc cctactgagt 300 

gttcccaggc agacaatagt tcctgggata gtgctgtttg gtaacagaaa agcccaagcg 3 60 

tagaggacgg attaaaaggc agggaccaga ccgccatgga tacaaatccc aagacagagg 420 

atgccccatg ccttccccat gaagcttatc tgtctgcctg tgtctccatg attgcaggca 480 

tagagctact tgggacctcc aggatgattt acttagcgat atgcttttta cattctaaga 540 

atcaaaatgg tcctgtaatt cccaatagag aaaatagagc caattcattg ttctcccctc 600 

tcccctctga agccagtttt taaagatgag ccttacccag aaaataagcc ccaaagaact 660 

ctcatctaaa tgatcagacc cttcctaaat tacctttggc aacctaggta attctttttt 720 

attacacacc tccaacctga ccctttctac agtttcaact ataaatgttc atgcccctcr 780 

tcaaataacg ttgctaggat gaatttgcca caggtttgag tacagagaga acaagcaaga 840 

aaaatgtcag tgtttatttt aaggagagtg gccaggatgt cagtcctcat aattggtccc 900 

ttctctctct ctatcctcca aggtaagttc tttgttgact tgataagctt tagtccttct 960 

gtacaacttc tagaagatgc acttaatggt gcttctttgc acttccagaa ctcaccttct 1020 

attctacctg taaggctgta ggggagcatc ccaatcaaca taaggcctac ccctttagcc 1080 

acgaaaatca gccaggcatc atgtttctgc accaccacct gccttcctga cggacactgg 1140 

tgctgatgac aaaaatggga cagtaccgca gctggtttct ctttttcgag tgtgtagata 1200 

agaaataaaa aacattttca ttccctcaca agcttaatct agtaatataa ctgcctaaaa 1260 

aaaatcaaac cataaataaa cctatgtgct aaacaaatca catgacttga tgacttctct 1320 

aaaattaatg tcaaggaaaa aaggaaaagt tgatcccaag taaaatccct tgaccacagc 1380 

tgtctgaaat tagccagggg aatgggagac accaccaaga acctcagctc tttcctgccc 1440 
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tgtatttcaa 
cgtcctcact 
aattcatttt 
gggctgggag 
ccatgattcc 
taattctgta 
gagctctttt 
aaggtttatt 
tgtgtaagct 
ctgacgaaat 
aacagatgat 
gctccacttg 
cagaccagga 
tttttttttt 



ggggagtgtt 
tctaacctgg 
ttctcagtct 
gtccagtcta 
cagagctctg 
aactcatggc 
ccaacttcat 
tggttaagga 
gaaaagaaaa 
gtctgaaatc 
ctcatttacc 
gaaatcacca 
agaagggcac 
ggaattcgat 



gtggccttca caaatgaaaa ttatgaatca caaagataaa 
tgaatcctca ggaatgtcat gaggatgaca acacagggtt 
cccccctgac tccacaaaag ctttgccttc ccaacacaag 
gacagagcat gctgttgggg taaacagtaa ccatgtgatc 
agcacaaagc ttttcatccc agtggcaact ggaatgtggg 
cacaccttta atgcttgggg acagtgggtg gagtcagcca 
ctagggtctt ctctctggaa aagcttagtg acgttctccg 
gtattgctaa aacacttttt aaaaatccac tttgaacaca 
tgacatatat acctccattg aagctgggaa agtgaaaagg 
ctgagccttt cctggttcta ttttaataca gcgtacaggt 
ttctgaatga cccagcactc aatttcccta aaactgctca 
ggggacttga gaatcttccc cttagactca gggagacacc 
tgatgttttc agggacccaa aagcccactt tttttttttt 
atcaagctta tcgataccgt cgacctcgag 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2270 



<210> 66 

<211> 1283 

<212> DNA 

<213> Homo sapiens 

<400> 66 

ggcacgagcg agggaacaga ttcccagggt ggggtggggt agggctgggc ctttgctcct 60 

ttgtctcctt tcccaaggca aagtgaagag aagagagtga ctccttcttc actcagggaa 120 

cccaggcagg gatgaagcgc ctgtggtgtc tgagctgggt cccggggctg caggggagcc 180 

cctcagtgtt gtcctctgta ttcttctccg tgttcaaacc acagctgcat tggacatgca 240 

gtcaggtgtc ctctcactgg caccctccct gccttttcat tcttttttct ggatagtctt 300 

tcatcaagtt ttctctgcct tcaccttgct cttcctgaat cagttcacct tgaggggggt 360 

taacagagca ccttggcagg ctctgttcct ccaggtccca ggccagcccc cgggactcag 420 

ggcctgcctt cccctcacct tcttgagcag cacaaactcg ttctctgctg ctgtccgctt 480 

gttgatttcc tcctcatacc tgtgggaatg gcgagggctc attggttaat atctcactga 540 

aagcccttgc tttcacaggg cagcgttgag caaggagcag cgtgtccatc agaagataca 600 

ggtgctgggg gccgcagagt actgggcagg ggtaagtggg ggaaggcttc ctggaggagg 660 

gaacatgcta accagtttgg aagaatgaga ctgttaaaga tccagcttgg caaacgagga 72 0 

aggagcacat ggagcgaatg ccctgtggga ctctcagagg aaccaggatg tgaactgccc 780 

tccccaaatt tgagtacagc tttacaaatg acaaagcgct ccaatctgca tttcctcagt 840 

tacccttgag agcagtcttg gagccagatg cacttaaccc tcctcttaca tgggagagaa 900 

cgtgagtggt ttccccaaac attccctaaa cccagagcca gagataaccc tctgcccact 960 

gcccagctca ctgggcattt gtcctaagag tcaggccaga ggctggagga gcagagagca 1020 

agttccagag ttttgttggg gtgactctgc ttgatatgac caagaacaat gccctccact 1080 

gacctccaaa gcatttaagc tggggtgact gccaggggtc ccttggaggg acaagggcag 1140 

ttgtccagtt acagggggac tcctcctgct cacctcttct tgtagtcctc cactacgtcc 1200 

cgcacattcc tcagctccga gtccagcctc accctgtccc cagacagcgt ctccagctgc 12 60 

ttccgcaggt tgctgatgta gcc 12 83 



<210> 67 

<211> 1263 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1256) 

<223> n equals a,t,g, or c 
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<400> 67 

gaggagatcg ccacctccat cgaacccatc cgcgacttcc tggccatcgt tttcttcgcc 60 

tccatagggc tccacgtgtt ccccacgttt gtggcgtacg agctcacggt gctggtgttc 120 

ctcaccttgt cagtggtggt gatgaagttt ctcctggcgg cgctggtcct gtctctcatt 180 

ctgccgagga gcagccagta catcaagtgg atcgtctctg cggggcttgc ccaggtcagc 240 

gagttttcct ttgtcctggg gagccgggcg cgaagagcgg gcgtcatctc tcgggaggtg 300 

tacctcctta tactgagtgt gaccacgctc agcctcttgc tcgccccggt gctgtggaga 3 60 

gctgcaatca cgaggtgtgt gcccagaccg gagagacggt ccagcctctg atggctcgga 420 

gatgatggac cgtggaaggg aagcgtctgt ggggagtgag cgcttagatg gccagcagct 480 

gctccttctg ggaagctcgc accttggcaa cagaacagcc ctctagcaga gcgtcagtgc 540 

agtcgtgtta tcccggcttt tacagaatat tcttgtccta ttttagaatt ttccggagta 600 

gtttatttgc agtctgttga ttatgtgcag tagacccggg acactgcgtt ttaccgatca 660 

ccttgaatgt ggtgcctgga tgtgcctttt ttttttttcc ctgaaattat tattaatttt 720 

ctattgtgag ttcatcagtt catagttttt ttagtaaaga agcaaaatta aaaggctttt 780 

aaaaatgtac aacttcagaa ttataatctg ttagtcaaat atttgttatt aaacatttct 840 

gtaatatgaa gttgtaatcc tggccgtgag cttggaagct tacttttgat tcttaaagcc 900 

tatgttttct aaaatgagac aaatacggat gtctatttgc cttttattgt aacttttaaa 9 60 

tgaaataatt tcatgtcaat ttctattaga tatatcactt aaaatatttg gttttaaatc 1020 

acaagaatat gtattcttta ataaagataa tttatgatca tggtataatt aattgaaatt 1080 

tattaaaatc tgtttttatt aaaaaaaaaa aaaaaaaaac tcgagggggg gcccggtacc 1140 

caattcgccc tatagtgagt cgtattacaa ttcactggcc gtcgttttac aacgtcgtga 1200 

ctgggaaaac cctggcgtta cccaacttaa tcgcctttgc agcacatccc ctttgncagc 1260 

tfc g 1263 



<210> 68 

<211> 1617 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1578) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
<222> (1586) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1605) 

<223> n equals a,t,g, or c 



<400> 68 

tcgacccacg cgtccgggaa acctgatact gcaacctgca atgtaggatg tttgtatggc 60 

atttaaaggt aatggtgatg tttattattc tatactttgc atactgtgag agtaattttc 120 

actctgtctt aagtgtgagt aagcctcttc taaaaatctt gttcttgcca agaaatttat 180 

aaatcacata cgaagacgtc tgttgctaac agttaacttt atgaggtaac tatatccttc 240 

tatttctctg gactcatttt taaaaaatat gccgaatact gcatactgtt taaggtagta 300 

tataagttta tgagagaagt ggagagcttt cttccttgaa aagtcggtat ttgttgagat 360 

accatttgcc tcacagagag gtgttcccca ctcccatccc cattgccaga taaataaata 420 

ttttgagaaa agtgacctaa aacagctgga aatcttaggt gcatctgtct gcagacctcc 480 
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ttaagcaggc tgtatcttac aattccctta ctgcactggg taagtgttaa cttagttttt 540 

gttttgccct ttgctttaaa tattctccaa attaccattt atgcaacatg gttagggtta 600 

atactgcatg gtattcattt acttgtttca tgaactttcc agtactgtac aaggtcaaca 660 

aagtaatgcc tgtggtatcc tcatctctca cttttttact ctgtggtttt agcacagtaa 720 

ggtactgcaa agaccttcct tccaaatgtc tccttgactt tattccttgg gccaattcag 780 

tatcctcaac atcctaagat tttgttgttt tatcactgac ctgtggttgg cctgttttat 840 

tctaatttcc agaaaagttc aatcccagta tttgcaatat caaataactc taaaaccgat 900 

gttgtgattc taccttcctt actattttta ctgggcaaat gccctatttt tttaattatt 960 

attattttta acttttggga cacacaaaaa tcagcaattc tcatgaagcg tttgttagtg 1020 

tggcagactt gtctaattcc tgaaactcat tcatcccctt gagccagcca atggggagga 1080 

ataggataat gcaaacacat gttttgtttt ctcattttca aataatttac catgttaaaa 1140 

taaacttttc tttgtttttt atttgtagag tcagctaagt acccatattt aaatgccgtc 1200 

tttattattt ttttgaggtc tttgtttttg tctgtttttg ttttgttttg ttttgtaaat 1260 

aaggtaactg ggcaatcaaa caccttttgg ggattctggc tttagtattt tatcagccat 1320 

tttaaaatta aatataaaaa tcctttgtaa gaaacttgca tcctaatttt tctttattgc 1380 

aattgaaagt gtaaataata agacaatgta agtaagacct tcctaatgtc taatacaaac 1440 

tgggcttcag caagtggcct atttttatta gggttttgaa aggttgtgtg tgtgtgtgcg 1500 

tgccgtgtgt gtggttttct tttttaaatg gatagtagag tggtggctgg ataagggtac 1560 

ctgtaatggg ggtttggnca gcaagnctga aatttatact tttgnaaata aaactac 1617 



<210> 69 

<211> 1389 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (755) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1177) 

<223> n equals a,t,g, or c 



<400> 69 

gcttttttag gcattcattg gacacttgct ttaataagaa tagtttctta gttggcataa 60 

tgcttctgaa gtaatggtac tttaaaataa tttatctcat gaactttaag cattcctctt 120 

gaaatcttgt ttttactcta tctagtcagc ccttataggc aatccaaagg gatgctttgg 180 

atgctttagt ccagtggttc tcagagagtg gtctgtggag tcctggaagt cgctgagacc 240 

ctttcaggca atttgcaagc tcaaaactaa tttcagaatg acaccaggat gttctgtgcc 300 

ttttttgctg tgttggctat ttgcactgat gatgcaagag aaatggggag gagtaaaatc 360 

actggtgtct taccactatt caagacagtg gcaccaaact gtagtagtct agtaggcatt 420 

gtatcctgca cagcctcccg ccaggagtgt gatggaagac caaggaagca gtgtaaatgg 480 

aggtacacgg gaagcacttc ctttgcaccg tgaaggatga tgactgtttt aaggaaaagc 540 

acttatgcca tggtttgtgt cgtgagctga accagctgcc tttttcatgg aacatgactt 600 

ttacttgaaa gagtaactga cagaaaacca attattctga cttgtgtttg tagcagacat 660 

tttcttgaaa atgaagaagt gggtctgtga cttcaggaaa atgtctgaca gtatctgttg 720 

ccaaagaaaa tgtgagtttt caagccaaga atagnaatct agaaaacttg tattcmcctc 7 80 

catgggcttg atgmcctctt agtacttaga cctttctgac gagataagcg gtggcattaa 840 

caaatgtgac gtttttcatg ttatctaaat acatttgtca acatttrraa gatctgcaca 900 

actccctgga ccaggattty ccaaatgatt gttgcttttt gttacaaaat cagggaatag 960 

atagaagatt cattcaaata atgaaagata gactgatgga tttttatgta acagaatagg 1020 

aaaagtttat gacatgtttt cagattccac cttgcaacta atttttaaga agctaccact 1080 
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tgtcagccag gcacagtgct cacgcctata atcccaacac tttgggagtc caaggtgggc 1140 

agatcacttg aggtcaggag ttcaagcctg gccaacntgg tgaaagactt ctctactaaa 1200 

actacaaaaa ttagctgggc atggtggtgg gtacctgtaa tcccagctac tctggaggct 1260 

gaggcaggag aattgcctga gcctgggagg cagaggttgc agtgagccaa gatcgcgcca 1320 

ctgcacacca gcctgggcaa caagtgcgaa actctgtctc aaaaaaaaaa aaaaaaaaaa 13 80 

aaactcgag 1389 



<210> 70 
<211> 1896 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (1802) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1856) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1886) 

<223> n equals a,t,g, or c 



<400> 70 

aaaacaaaaa agctaataat ctcctcaagc aatttctggc ctaatagaat tatagtagac 60 

agtgaagtat ctaaacccag ggaatcagat tgaggcacca tgtccatcgc cttgagaatt 120 

aataggctgc atttctgggt tctccttttt tttttttttt ttgcccaact gagtctttct 180 

gtggacttac atggaacttc ttattctctt aaatcattaa gttacttgac aatattcttg 240 

gatttggaga aactggatgt agggccgtat gaaaaaatca ttcgaaatca gatttagggg 3 00 

tataaggttg gataggaatg ttttagaaag aagaatgtaa ggcagataac taatttgtca 360 

catccaaagt ataaaactgc tactttttcc ctagaaaagg gaagctcatt ttaggcagcc 420 

taaaccagta agattttctt cctcctccaa gtgcagattt ttgtaccttt cgtttgtcaa 480 

aacattcttt ggccctatgc atgccagagt gatatagaaa ggaagttacc acattttttt 540 

gagaacaaat cactcctgat aaaatttctt agacaattga taatcatttt aagaagaaat 600 

ttaattgtat ttagctctgt gtctcgcccc tttggtgtca ctcttctacc tcttccatca 660 

ctatagctaa atatttagaa gtatatcttg acacctagca caaatgtttt ggttaagtat 720 

cttaaaactg atggatggta tggctggggc agcatggctc acgcctgtaa tcccagcact 780 

ttgggaggcc aaggcgggtg aatcacctga ggtcaggagt ttgagaccgg cctgaccaac 840 

ttggagaaac cccgtctyta ctaaaaatac aaaaattagt csggtggtgg cgcatgcctg 900 

taatcctgtc tactcaggar gctgargcag gagaattgcc tgaacccggg argcarargt 960 

tgcartgagc tgaratcgtg ccattgcact ccagcctggg caacaagagc aaaactcagt 1020 

ctcaaaaaac aaaacaaaaa acctgttggt atagtacgaa agaaacgtct tgcagttttc 1080 

tgttgcagag aattaattag aaccaacctg ttggattata cacattcacc tttcagaatc 1140 

ctttcttctc tgtggaaacc cacactctca gcagtgtgtg ggaacacagt agattcttaa 1200 

ggaatgcttg ttgaatgttg cagtctgcat cttcttgaag taacagaact gttggtagct 1260 

gtttaaaagt aaaatgtgtc taaagacctt ttggaaatta agatgtaaga gattaatgca 1320 

ccaaagcagt ctcttaatta cttaaaatga attatttcaa agaatcttta attgaatttt 1380 

ctgtgaagtc tggaatttgt aaattatgtc cctttgttca aaccagcccc tgaaaagaac 1440 

aattaaggca attaagatag cattaaagtt ttcaatgaag ttggcatttt cygtgtatta 1500 

agattagatg ttagctgctg aagtttgtgg aggtcggaca taaagcttcc aacatcagta 1560 
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atgcaaaatt 
aaagggccat 
tttgatattc 
attttgaaaa 
angaggatta 
tccctttttt 



gtcttgaacc 
gtagcatgcc 
atycaagttc 
gttttccaaa 
aaccttccag 
ttcccaatgg 



tgcgataaaa 
tcaaagccag 
aattttcyca 
gaaagtaaaa 
gttccaaatg 
gtaatnaaat 



ttttgttgga 
gttactcagc 
cctgatttwa 
aatttaaata 
gttttgggtg 
aattaa 



cttttttttc attgcagtgr 
ctagtccttg tttaagcagt 
kgattaattt cctkggaaaa 
atccggtaac cccgtataat 
gcaattttcc cttccnaagg 



1620 
1680 
1740 
1800 
1860 
1896 



<210> 71 
<211> 308 
<212> DNA 

<213> Homo sapiens 



<400> 71 

ggcacgaggc 

cgggcgccgt 

agctggagct 

ctgatgaggc 

ctggacagcg 

aaaaaaaa 



ggcgctgcga 
gctggccgcc 
ggacgtggag 
gggtcggccg 
cccgaggact 



ggacccatgc 
agcctgctct 
ctggtgcccg 
ccacccgagt 
gggacattaa 



agctgacgct 
gggcgtgcgc 
aggacgacgg 
gagcgacacg 
acctgacctc 



ggggggcgcg 

cgtgggcctc 
gacggcctcc 
gccgtggggc 
ccctcctcca 



gccgtgggcg 
tacatggggc 
gcggaaggcc 
ctggcaggcg 
aaaaaaaaaa 



60 
120 
180 
240 
300 
308 



<210> 72 
<211> 1688 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (912) 

<223> n equals a,t,g, or c 



<400> 72 

acccacgcgt 

tgaacatatc 

aatgttgcag 

aagaatgctg 

aaacaaatat 

ccttttttct 

agtcttttaa 

atgatgagag 

tagtggagta 

caactggcta 

aatagatttt 

acttttgtga 

aacaggatac 

aactttccct 

agacagaaag 

cccagctaag 

ttctgggaac 

agggtgccac 

agctgcctgt 

gatacaagag 

attttgagac 

ttgtgcagaa 



ccgctcatgt 
gctttccctt 
cctggttgag 
ccttgtctgt 
tgtgtgctct 
ttaccccttt 
attctgtctg 
acagcacaat 
atttaagaac 
atgaattttt 
aaatggctaa 
atctgacagg 
taacgccatg 
cagctattat 
agcttctggg 
angtcacttg 
ttggagagtg 
tttctcaaga 
gttctccctg 
ggatcattgg 
cctaatcact 
gtatgtatgt 



ggacttatgc 
tttcctctcc 
aaggagagaa 
ggacaaagat 
taacgattaa 
actctgcaag 
cctactagtt 
aaatgtacct 
tctcttgcct 
aaaaagagaa 
actactagcc 
ccacattttt 
gagttgagct 
gcaacagatc 
aaacaagctt 
gtttgggctt 
gatgatattc 
tgaaaactgt 
tgtgggtgag 
ctccaatttt 
ttaccttcct 
atttagttca 



cagtctagag 
ctccgcccct 
aaaggtggca 
ggaccatgtg 
gctgtgttat 
aatggggaaa 
ttaagtatat 
tatctcctta 
tcaccaaccc 
gaaaaatact 
ttaaaactac 
atatggccct 
gggcctagcg 
agggaaaaag 
acatagtctt 
cattaggact 
aggctctgaa 
gactgaaaaa 
agaagggact 
agagaacttg 
ccaaattacc 
ggttgacttg 



gcagaatcag 
cccagtacag 
ggaatttcca 
cccttcggaa 
ggtgggtttt 
gaatgcatac 
ggtatgttgt 
ggctgaaggc 
aaaaggttgc 
agttttcccc 
tagtctataa 
ttacagaatg 
atggagggac 
atgggatgac 
ttttaaaatg 
ggagactttg 
acattcccag 
attaataata 
agactcctaa 
aaagcaaggc 
caacatacgg 
tgtccttata 



aaggcttggt 
tccatctttc 
ggagatcccc 
ttagggatag 
caggttttta 
tgcgaaaatg 
aaaatttcca 
cataactaca 
tttttgatag 
tcttttggga 
atcaactacc 
gagtgtgttg 
actctaacac 
agatggggtc 
cacaaagcct 
ttggagttct 
cgctctcccg 
aatgtttctg 
gcctgcctca 
tttggacaaa 
taaacaacat 
aactgttact 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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caaatgattt gaacttttat gcgactggga tttttttttt ccaaagctac aagcatggcc 1380 

gcctgtggta tcgaggtgtt gcaaacaata tctgtgttgc gcttcctgtt ttaacctacc 1440 

tcgttttgtt tgtttttgtt tcactgttca tcacagcagt gttatctcca ggagacatat 1500 

agagagctca accggcaatc tcaggtgcat ttaacatttt taaaacgaaa cagtagttga 1560 

ccaaattttt cttcttaaaa aattggaagt ggggggaatc caatgacaaa aactaatgtg 1620 

gcttgtttct ggagaaaata attactgtaa atggaacaac aacaacaata aaacacacgt 1680 

taaacatc 1688 



<210> 73 

<211> 1138 

<212> DNA 

<213> Homo sapiens 



<400> 73 

gggcgcctgt agtcccagct attcaggagg ccgaggcagg agaattgcct gaactcagga 60 

ggcggasttg cagtgagccg agatcgcgcc attgcactcc agcctgggtg acagagtgag 12 0 

actctttctc ccaaaaaaaa aaaaaaaaaa aaagtcaaat gcagctggga atgtggttcg 180 

tgcctttttg tatattaacc atttgaaact tggttgtaag gtggggttgg caatgtcagg 240 

cctggctgca gcagctcatg tctttagagt gtgcctcttc cctctctcgt ggggctcgag 300 

caagactacc ttcatacatg ggctctccag ttacatagca actccagtgt taaattccat 360 

cttttcttcc tggaaaagcc gtagaaagga cacctggaca tgcctgctgc acaggttgtc 420 

tgccttcccc atcagccsca gaaggaggaa ctttgctctc ttctctcaca gctgtgtgtg 480 

cataagaagt agttcggatg atgtgggtcc caccatgtat tccttctctg ttccatgtag 540 

agtaaaataa atgggagttc tgtttaatgc atcacctcgg ttcatattgc atttgccaag 600 

aaagtgcaat tttattgaac attaggattg aattcttaac tgagtaatca atttcagtag 660 

taagttaaaa tgccttctat taatggacaa ctgcaaccgt taatcagagt tacagtagat 720 

taacagttgt cagcatttat gctaatagca ctaataaacc gtgggctcat gatttgcact 780 

ttataattcc atatttctca aaacagttgg taatactttt tgcttgaagg tattgattct 840 

tttgtccctt tgcttgctac ttggagatgt agagaaagct aaatgacatt ttcacggtga 900 

tgacacaata tcaccttctg cttttgcaca cttggctttg tgtcaaaata gatggaaagg 960 

gttcatttgt tctggtgctc tactgtttaa tttgatctgg tgtgtgacta aagcaagaca 102 0 

aatagtattt ttaatgaaac catttaataa cctctggtag cttagagtcg aaggcattgg 1080 

aaaaatgcaa ttaaaggatg cctagatgta aacaaaaaaa aaaaaaaagg gcggccgc 113 8 



<210> 74 

<211> 777 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (761) 

<223> n equals a,t,g, or c 



<400> 74 

gtagcacctt gaaattgggc agttctgaat atgctgttga gtaaagggaa aatcactatc 60 

tttttaggac ctttggaatg tggctccatg catttgctaa cgttgttctc ttcagggctg 120 

atttttctgg gctgttctac tcctctatcc ttctgtgatt gtcttccaat tcttttatta 180 

tggttagagt tccctgtaga aaccagtggg gtgtgtagtt aacaagtgtc aaaaggagta 240 

gaataattac tttatgtgat gtacttacga gaatactact tagtagaatc cagtataacc 300 

aaacaaaatg tagacgtatt taactatcca gtgtgtcagc tacacatttt ctcatcttta 360 

tcacttctgc tctggatatt gtgatctact ttcctatctc tttgtatttg tkttttcaca 420 

tttctttttt gtgtaaactg ctctatactc ttattgaaac ttgagcataa ttttatattt 480 
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acatagtaaa gtttctgata tgattagaac ataagttgtw tctcctaatt ttccaataga 540 

gccattgatk tttcatttta acccctttta atggaacact tactagcttt ctaaattata 600 

aacttactaa ctttaaatat acatgaatta tttctcagca ttcttaaaac aagggtactg 660 

aacttcgwtt tcttgatgat tcaggggaaa agaattctga gtgttggaaa ggactttaga 72 0 

tgtcttccaa ggatttatgg gatcaattta agaaaaaaaa naaaaaaggg cggccgc 777 



<210> 75 
<211> 1060 
<212> DNA 

<213> Homo sapiens 



<400> 75 

gatgtatttc cttaatatgt agtttcagaa gtggaattta ttagagttaa actaaactca 60 

ttaaatttag agtttcttat tgtctttcat gagaacattt ttccttttca ttcataaatg 120 

atattgaaac actatattct tactttcata ttcctgttta tatttttgtt tttcatgtta 180 

aacattttac attctaatag taacctcatc gacctgttaa aaggcaatat aagatttaga 240 

ttattaaata gcatgtaata tatgtgatca gtaatcttca atgagcttgk tcttcattta 300 

attgcaacgt tatgtctgat tttttttgkt gcaaagcttt cagaatcttg acttgtggta 360 

atcttctttt aaaaaagctt ttaacagaat taatargtca tcacgttatg ataaatgatt 420 

aaggaaatga tgcctctaat acatkgaatt attaaaacta tcattttgaa aaattatatt 480 

ggtacaaact agtgtctact gctattactc atacatttca gaattcatac atggatatcg 540 

tctaggattt tttttttgcg taatcatgag ttacggtgtt aaagttatag tgttaattta 600 

attatgttat agtgttaatt tatctgtttt acatctcact tttgtatctg aaaccgttcg 660 

aaaataatta ttattaaagg ccagttgaca aaatttccac tcctcctccc cagtgtgact 720 

ttccttattt gtattatacc tataaagact acctcttaca tcggccaggc acagtggctc 780 

acgtctgtca tcccagcact ttgggaggac gaggtgggca gattgcctga gctcaggagt 840 

tggagaccag cctgggtaat atagtgagat cctgtctcta caaaatatac aaaattagct 900 

aggcgtgcct gtagtcccag ctacttggga ggctgaggtg gtaggatggc atagagtcca 960 

ggaggcagag gttgcagtga gctgagatgg tgccactgca ctccagcctc agtgtcagag 1020 

ccagtccctg tctcaaaaaa aaaaaaaaaa gggcggccgc 1060 



<210> 76 
<211> 1503 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (6) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (18) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (41) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
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<222> (1501) 

<223> n equals a,t,g ( or c 



<400> 76 

gtggangccg ctcctganaa ctagtgggtc ccccgggctg ncaggattcg gcacgagaat * 60 

gaatggcaaa gaaatagaag gggaagaaat tgaaatagtc ttagccaagc caccagacaa 120 

gaaaaggaaa gagcgccaag ctgctagaca ggcctccaga agcactgcgt atgaagatta 180 

ttactaccac cctcctcctc gcatgccacc tccaattaga ggtcggggtc gtggtggggg 24 0 

gagaggtgga tatggctacc ctccagatta ctacggctat gaagattact atgatgatta 300 

ctatggttat gattatcacg actatcgtgg aggctatgaa gatccctact acggctatga 360 

tgatggctat gcagtaagag gaagaggagg aggaagggga gggcgaggtg ctccaccacc 420 

accaaggggg aggggagcac cacctccaag aggtagagct ggctattcac agaggggggc 480 

acctttggga ccaccaagag gctctagggg tggcagaggg ggtcctgctc aacagcagag 540 

aggccgtggt tcccgtggat ctcggggcaa tcgtgggggc aatgtaggag gcaagagaaa 600 

ggcagatggg tacaaccagc ctgattccaa gcgtcgtcag ccaacaacca acagaactgg 660 

ggttcccaac ccatcgctca gcagccgctt cagcaaggtg gtgactattc tggtaactat 720 

ggttacaata atgacaacca ggaattttat caggatactt atgggcaaca gtggaagtag 780 

acaagtaagg gcttgaaaat gatactggca agatacgatt ggctctagat ctacattctt 840 

caaaaaaaaa aattggctta actgtttcat ctttaagtag cattttgctg ccatttgtat 900 

tgggctgaag aaatcactat tgtgtatata ctcaagtctt tttatttttc ctcttttcat 960 

aaatgctctt ggacattatt gggcttgcag agttccctta ttctggggat tacaatgctt 1020 

ttatcgtttc aggcttcatt ttagcttcaa aacaagctgg gcacactgtt aaatcatgat 1080 

tttgcagaac ctttggtttt ggacagtttc atttttttgg atttgggata gattacatag 1140 

gagtatggag tatgctgtaa ataaaaatac aagctagtgc tttgtcttag tagttttaag 1200 

aaattaaagc aaacaaattt aagttttctt gtattgaaaa taacctatga ttgtatgttt 1260 

tgcattccta gaagtaggtt aactgtgttt ttaaattgtt ataacttcac acctttttga 1320 

aatctgccct acaaaatttg tttggcttaa acgtcaaaag ccgtgacaat ttgttctttg 1380 

atgtgattgt atttccaatt tcttgttcat gtaagatttc aataaaacta aaaaatctat 1440 

tcaaaacaww aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1500 

naa 1503 



<210> 77 

<211> 872 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (844) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (858) 

<223> n equals a,t,g, or c 



<400> 77 

ggggaagttc ttcactgcct tgcatttgac tccagatccc tccatcctcc cagagccttg 60 

gcctcaaaaa tgctgattct agcatcatgg aaatgctgtc ctcaaagtgg tctaaacggg 120 

ttgctgcttc acttgctcac ttaatctccc ttttcatagg gctgttgttt ttacttctgg 180 

gaagttctgt ttaccctgga acagaaactc tcttccctaa aagttgattt tattgaccca 240 

tggaggccag agacacttag gcatattttc cctccagact agaagcttct gaggaggacc 3 00 

tcctgagtct gcaccctggc tccctgctgt gctgagggcc cccgtgttaa cctcacgttg 360 

tgcctcctct gattcagagg gcccagtgtg gttctgtcag ccaggcagtg gccccagctc 420 
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tacagaaatg agttgtcatt gcatcctagg gccagggtct tcgtgcttgt gtgtgttacg 480 

tggaagtatg tggacaccaa gtgttcctgg atggccacag cctgcgaagg aaactggggc 540 

cagcagctgc tctgtgtttt cagccaacaa tggctcctgc ccactgccgc tgcataacca 600 

ccagaggcag gcttctcttg acacaggcct gtcgttggag catgtgcctg gcgagtccta 660 

tttctattcc cctgtgggtt agggacaggc agctgtacct tcagtgtgtt gctggggcag 720 

gagaatcgct tgaaccggga ggcggaggtt gcagtgagcc aaaattgcac cactgcactg 780 

cagtctgcag gacagagaga ggctmtatct caaaaaaaaa aaaaaaaaaa actcgagggg 840 

gggnccggga cccaattngc catataggaa aa 872 



<210> 78 

<211> 573 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (560) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (563) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (566) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (567) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (571) 

<223> n equals a,t,g, or c 



<400> 78 

gatcaagttc cttagttttg acaatcaggt cccaaactct ttttcttgcc tcatttattc 60 

attcaacaag tattttttgt gcactgaata tgtggccctc actaggcaga tgctgcctat 120 

tcttttgcct gttaactaat ttaacctctt gtcatacctc ccaaatcacc ttatgctcca 180 

gagaaacttg tgtatggtca cgtaccacat aatgatgctt tggtcaacta cagatagttg 240 

atccagtaag attaccatgg agctgaaaaa ttccaatggc ctagtattta ctgtgctttt 300 

aattattatt ttggaatgta ctccttttac ttataagaaa cttagctgta aaacagcctc 360 

agtcattttc ttcatgaggt atttcagaaa atggcattgc tatcatagga gatgacagat 420 

tcatgcattt tattgcccct gaagactttc cagtgggaca agatgtggag gtggaagaca 480 

gtgatattga tgatcctgac cctgcataaa tctagcttaa tatgtgtgtt tgtcttagtt 540 

gctgacaaaa aaaaaaaaan aanaannaaa naa 573 



<210> 79 
<211> 1509 
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<212> DNA 

<213> Homo sapiens 



<400> 79 

ggcacgagga tgtacctaat gagcttctcc attcactttg taaaaataat ttgtatgtgt 60 

accatcttgg tcctctcccc tcccgttttg ttaaaatatc aggatagcac tcccaggcca 120 

ctttggtctc agtgtaagat ccctattaac tatctgaaag gaaaatagag ccaagacctc 180 

tggtctcaaa tatataggaa ttgcctttct ttagtcttca ggactattgt gtgaaaacaa 240 

gtaggggtct aatctcctag aaggtagggg ctttatcctt aaagagaa.ta tgtccccaga 300 

ttattagcac ttttagagga gaagccaagg tatgtagggg tgtgtggctg gcccatcagt 360 

ggagcacgaa gagagaatgg gataccattg tgggaagaga agaaaagttc ctcaggggcc 42 0 

tcccactgct aaagtttttt gtgagatgtt gatctgtgct tcctggattt gacttttaaa 480 

ggaattattc tggcagcaca tgtagtattc ttggatgatc ttgctgctct tatttctcct 540 

tttgtgtgtg tgtgtgtgtg tgtgtggcta tgggttttca tttgtaactc catctgctta 600 

ggagagtggg ctctctataa gggaacctgc tgtaaacttc attgcagcaa ggatgtagag 660 

agaaatagga cttaattcca ctaggggctc tcatctcaca ccttaaggag gagatttcta 720 

gaaaaactgg gccagatttt ctttgttctc catcatttta atgtggcagg ctgttcagtt 780 

ttcttactct tacctatgtg atatttcttc gtaacgtgtc caaaaagaaa aaagacccaa 840 

tcagtgtctc ttgactttgt tctttgatcc ctcagtttct tcttgatttc agcatgtgtc 900 

gggttcctaa ttttgggtat gagttagcaa atttaaccat tgtgtttgtg ccctacccag 960 

gggactcccc agtttctgac ttgaagtaga ctgagaagaa tccacgaggt gctatctggc 102 0 

cagatttaag tagattctat ttccttggtt ctccctctcc ctgaggacct cttattttat 1080 

tgtcccctct tctaggttaa ttctcctttg atttgacttt gttgagaagg aggttggaca 1140 

gtagattagc aaagttccaa gtgcaaaatt acagtgtgtt agagtgtggg gggaaaatta 1200 

gtcttatttt tccctacatg ggatacaaca ctgtgaattc aatcttcaac tgaaggccct 1260 

gcagttctcc taaaacatag ttgtttgttt ttctttaaca aagtttaagc tagtgttaat 1320 

aaattaaaaa aaattgcttg tctgtctact tcagctttgt tttatgccca tttcatattg 1380 

ttgtctgtgt tgtaattcat aacttttgat accatttctg atgtgtaaaa ttggttgtct 1440 

tgtaaatatc ttataaagag ttcaattgta aataaactat tgtggctgtt aaaaaaaaaa 1500 

aaaaaaaaa t caq 



<210> 80 
<211> 1109 
<212> DNA 

<213> Homo sapiens 



<400> 80 

ccacgcgtcc ggccgcagaa cgggctccgc ggacgacggg ctccagggac gcacaggcag 60 

cgggcctccc accgcgggtg ccgggggcgg gggggctgcc cccatgcggg gcccttcctg 120 

gtcgcggcct cggccgctgc tgctgctgtt gctgctgctg tcgccttggc ctgtctgggc 180 

ccaagtgtcg gccagggcct cgccctcggg gtccctgggc gccccggact gccccgaggt 240 

gtgcacgtgc gtgccgggag gcctgccagc tgtcggcact ctcgctgccc gccgtgcccc 3 00 

cgggcctgag cctgcgcctg cgcgcgctgc tgctggacca caaccgcgtc cgtgcgctgc 3 60 

cgccaggtgc cttcgcggga gcgggcgcgc tacagcgcct ggacctgcgc gagaacgggc 420 

tgcactcggt gcatgtgcga gccttctggg gcctgggcgc gctgcagctg ctggacctga 480 

gcgccaacca gctggaagca ctggcaccag ggactttcgc gccgctgcgc gcgctgcgca 540 

acctctcatt ggccggcaac cggctggcgc gcctggagcc cgcggcgcta ggcgcgctcc 600 

cgctgctgcg ctcactcagc ctgcaggaca acgagctggc ggcactcgcg ccggggctgc 660 

tgggccgcct gcccgctcta gacgcgctgc acctgcgcgg caacccttgg ggctgcgggt 72 0 

gcgcgctgcg cccgctctgc gcctggctgc gccggcaccc gctgcccgcg tcagaggccg 780 

agacggtgct ctgcgtgtgg ccgggacgcc tgacgctcag ccccctgact gccttttccg 840 

acgccgcctt tagccattgc gcgcagccgc tcgccctgcg ggacctggcc cgtggtttac 900 

acgctcgggc cggcctcctt cctcgtcagc ctggcttcct gcctggcgct gggctctggg 960 

ctcaccgcct gccgtgcgcg ccgccgccgc ctccgcaccg ccgccctccg cccgccgaga 1020 
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ccgtccagac ccgaaccccg atcccgaccc ccacggctgt gcctcgcccg cggacccggg 1080 
gagccccgtc cgctgccgcc caagcctga 1109 



<210> 81 
<211> 807 
<212> DNA 

<213> Homo sapiens 



<400> 81 

cccacgcgtc cggacgtcct gatagatcct ctgctccaat aggcaactcc ggccttcccc 60 

gccctgacct ggaacctctg ggagggctgc agagtaagtg ccgcctctgc gctccgacgg 120 

aggcacgagg cctgtggagt aggtccctct gttccgacag gtgcgacact tggcgctcca 180 

tgcttgcggg tgccgggagg cctggcctcc cccagggccg ccacctctgc tggttgctct 240 

gtgctttcac cttaaagctc tgccaagcag aggctcccgt gcaggaagag aagctgtcag 300 

caagcacctc aaatttgcca tgctggctgg tggaagagtt tgtggtagca gaagagtgct 360 

ctccatgctc taatttccgg gctaaaacta cccctgagtg tggtcccaca ggatatgtag 420 

agaaaatcac atgcagctca tctaagagaa atgagttcaa aagctgccgg ttcagctttg 480 

aatggaacaa cgcttatttt ggaagttcga aaggggctgt cgtgtgtgtg gccctgatct 540 

tcgcttgtct tgtcatcatt cgtcagcgac aattggacag aaaggctctg gaaaaggtcc 600 

ggaagcaaat cgagtccata tagctacatt ccacccttgt atcctgggtc ttagagaccc 660 

tatctcagac agtgaaagtg aaatggactg atttgcactc ttggttcttt ggagccttgt 720 

ggtggaatcc ccttttcccc atcttcttct ttcagatcat taatgagcag aataaaaaga 780 

gtaaaatggt aaaaaaaaaa aaaaaaa 807 



<210> 82 
<211> 1043 
<212> DNA 

<213> Homo sapiens 



<400> 82 

ggcacgagtt gggccgggca cccccagaag ctgaccttga gacaaggatt tgggtgcaag 6 0 

tggtttattt ggcaggtgcc cagaaagtgc tgacaggagt gggaaagtga gttaggggag 120 

agaaggaagc cactacaggc tatgttcatg tgcaggttac tgctgtgggc aactggggct 180 

tacggatttc taggagatga cgtggaatac acctcagtgt tgccccacca gaagggcaag 240 

gaagcatggg tatttatatg tcagctccca t teat tat tg gctgagggca gctcctagag 3 00 

ggcattgggt ctgcgtttca agcctgctgc acataggctg agaggaatcc ctgagttcga 360 

gtcacaggcg cccacagtca tgctcagaca gcacatacag gaacagtgac tgcagggggc 420 

ataggtggga cacaaatacc accagttata aagaggaaag atgggaagga aagacaagag 480 

gaaggtgtgg agttagattc ctgggtcaga tgtgaacccc tggctctcaa aacactcctt 54 0 

ctttttttct ttttcttttt ttttgagaca ggatctcact ctgttgcaca ggctagagtt 600 

cagtggtgta atcagggctc gtggcagcct ctacctccta ggctcacatg atcctcccac 660 

ctcagcctcc tgagtagctg ggactagagg cacacatcac cacacttggc tagtatttaa 720 

atttttctgt agaagtccag gcgcagtggc tcatgcctgt aatcccagca ctttgggagg 780 

ccgaggcagg tggatcacct gaggtcagga gttcaagacc agcctggcca acatggtgaa 840 

accccgcctc tactaaaaat acaaaaaaat tagcctggtg tcgtggcagg ctcctgtaat 900 

cctggctcct tgggaggctg aggtaggaga atcacttgta cccagaatgt ggagcttgca 960 

gtgagctgag atcatgccat tacactccag cctgggcaag aagagtgaaa ctccatcgca 1020 

aaacaaaaaa aaaaaaaaaa aaa 1043 



<210> 83 
<211> 1173 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> SITE 
<222> (548) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (603) 

<223> n equals a,t,g, or c 



<400> 83 

gctgtctcag aaaaaagaaa aaagtttcta aagtaaaaat tgaaagtact tcccctacaa 60 

ccacaggttg ctttgacaga ttaatgtaaa ttcttccaga tactcttctg tggatgtaga 120 

aacatgcaga atgaggcaag ctttaatttg cttatgtcac ttactgtgga tagcctttca 180 

tatcttataa gttaatgtca gagcagcaat ctcatttttt tccaatttgt aaacatttta 240 

tttaacctta tgatggatat tttggtggat ttcagtatta caaaaatgcc tattaatagt 3 00 

atattttcat tatatttctg ttacgaaatt ataatgctac aaacattact atgcctgtgg 360 

cagtatacat ctgcacaagt tttgaaaatg ttatgcattc ataggcaaaa atgggataac 420 

ttttgggcag tggtcatgat taatctgttg atcagaatcc agagattgcc cttctccttg 480 

ccaattgctt taagagtacm ctagtttttg gccgggtgca rtggctcatg cctgtatccc 540 

agcatttngg aggccaagac gggcggatca caaggtcagg agatcgggac catcctggct 600 

aanttggtga ggccccattc tactaaaaat tccaaaaaaa cccacmaaaa ccaaaaaaac 660 

ccrgccttgk tggtgggatt acaggcatgt gcmacaacac ccggctaatt ttttttgtat 720 

ttttagtaga ggtggggtgt caccatgttg gccaggctga tctcaaactc ctggccttaa 780 

gtgatctgcc cacctcggac tcccaaagtg cggaattaca ggcgtgagcc accgcgcccg 840 

gccactggtt tttaaacttt attttgaaat tatttcaggc tgggcgcagt ggttcacgcc 900 

tgtgatccca acactttggg aggccgaggc gggcggatca cgaggtcagg agatcaagac 9 60 

catcctggct aaccccgtct ctactaaaaa tataaaaaat cagccgggca cggtggcagg 1020 

tgcctgtagt cccagctact cagtgggctg aggcaggaga atggtatgaa cccgggaggc 1080 

ggagcttgca gtgagctgag atcacgccac tgcactccag cctgggagac agagtgagac 1140 

tctgtctcaa aaaaaaaaaa aaaaaaactc gag 1173 



<210> 84 

<211> 1561 

<212> DNA 

<213> Homo sapiens 



<400> 84 

ggcacgagtg aggctcatgt ctgacctgca gaactgtata atgataaatt atgttgtttg 60 

aaaccgctac atttgcggta atttgttaca gcagcaatag aaaacgaatc ccctgcccag 120 

aatgacttcc tcctttcctg tcggacgaag gctcaggcct tctgctgaaa gcttgctccc 180 

ctaagtagtc acacccaatg ccgaatactc cccagaagca gctgctattt tctgaggaca 240 

atgagttgct tgtaagcctg agaacaggac gaaaacccac tttgcaagca gccctgcgtg 300 

tgacgggcat gccctcggag ggcaggttgg ttttgctatc tgctttctgt cctgcctttt 360 

tccccccatg ggtcctgtct ggctcttttg ctttctcact ttgtgcagaa agccatctca 420 

actcttctca caggagaata gctgtatgga cgtagcaggg ggagttacca catgcctacc 480 

tccatggttt tcgagagggg cccctgccca aatgtctcag tggccacctt catcagacca 540 

tggagcagtc agagcgggaa gggattctag agttggtcca gtccaaccat ctcatcttac 600 

atgtgaagga ggaaaggaag aaagggagaa aaataagaaa gctgaggtca accctcctac 660 

agggatgggc ctggccaaca ggatcccaag ggatgacata acattgaaat taagaaacca 720 

aggaaagttg agaactaaag aaaacagaac ccagtcagcc aagaggcatc cttgagggcc 780 

aaccaagccg tcaaacctgg atgcccccga cgagtcagaa agtcgggtgc ctcaagagcc 840 
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aagcagccaa gaaatggggt ctggaactgg cactttggtc cgcctctgtg cactcaccca 900 

gaaagggtgg aagggaccct gggaccaagt gccaaggtca cacaacggat gaatagactg 960 

ctggacttca aactgaacat gccattttgc caaagcagtc atcaccttcc gtgaatcata 1020 

aatgtttgtt caaagccaca aatgtatata ctctttgtat gtatacagat tttttctaaa 1080 

ggttaacatc taaacagatc aattaaggtc agccttaatt tgtctgagct ttttggttaa 1140 

agtttcctga gtaattgagc gaattcaagt ttctggcttt ctcctttctc tttctccatt 1200 

taaaacatga tctcatgaaa tttttgtccc aagaaaggca ggattacatt ttcttttaac 1260 

agtttgagtt ggtgtagtgt attcttggtt atcagaatac tcatatagct ttgggatttt 1320 

gaattggtaa atattcatga tgtgtgaaaa atcatgatac atactgtaca atctcagtgc 1380 

cacaaaattg gatgttgtgc ctacacacgc acaggaccta gaagagcatg tcaaactata 1440 

aactgcctgt gattgtgaat gactttgttc tttgcttctt gcgtttttca gtttcctata 1500 

atgcacatct taacttttaa aaaataaagg ttattttaaa agccaaaaaa aaaaaaaaaa 1560 



1561 



<210> 85 
<211> 1433 
<212> DNA 

<213> Homo sapiens 



<400> 85 

cccggagccg tggacgccct acagctgaga aggggaccca aggggtcggc cgcggccaag 60 

gcccctagga ccgccgcccc agctcacgct gccgacggca ttatkagaca ttctgcgtca 120 

ggtccgggct cctggacttc gcctttcccg agccctggag gtggggagaa aaggttcacc 180 

aatttttaaa atccaaatat atctcatggt acagtggaag aactggccag agagtctgga 240 

agtttgggtt ctggtcctgg ctgtgccact gactcactgt gaccttggga tcttgtgctg 300 

tgaagacatt tcccaagtgc ttcatgttag ccagcaaatc tgacccacaa ggcctggaaa 360 

gaggtgattg ttaggttgcg cagaggtggt cttatccagc tcagcttccc ctgggaccca 42 0 

ccgtgggacc tgaggcagaa ctggggtgga cttggcctcc tccatggcac accggctgca *480 

gatacgactg ctgacgtggg atgtgaagga cacgctgctc aggctccgcc accccttagg 540 

ggaggcctat gccaccaagg cccgggccca tgggctggag gtggagccct cagccctgga 600 

acaaggcttc aggcaggcat acagggctca gagccacagc ttccccaact acggcctgag 660 

ccacggccta acctcccgcc agtggtggct ggatgtggtc ctgcagacct tccacctggc 720 

gggtgtccag gatgctcagg ctgtagcccc catcgctgaa cagctttata aagacttcag 780 

ccacccctgc acctggcagg tgttggatgg ggctgaggac accctgaggg agtgccgcac 840 

acggggtctg agactggcag tgatctccaa ctttgaccga cggctagagg gcatcctggr 900 

gggccttggc ctgcgtgaac acttcgactt tgtgctgacc tccgaggctg ctggctggcc 9 60 

caagccggac ccccgcattt tccaggaggc cttgcggctt gctcatatgg aaccagtagt 1020 

ggcagcccat gttggggata attacctctg cgattaccag gggcctcggg ctgtgggcat 1080 

gcacagcttc ctggtggttg gcccacaggc actggacccc gtggtcaggg attctgtacc 1140 

taaagaacac atcctcccct ctctggccca tctcctgcct gcccttgact gcctagaggg 1200 

ctcaactcca gggctttgag gccagtgagg gaagtggctg gccctaggcc atggagaaaa 1260 

ccttaaacaa accctggaga cagggagccc cttctttctc cacagctctg gacctttccc 1320 

cctctcctgc ggcctttgtc acctactgtg ataataaagc agtgagtgct gagctctcac 1380 

ccttccccca ctaaaaaaaa aaaaaaaaaa actcgagggg gggcccggta ccc 1433 



<210> 86 

<211> 1377 

<212> DNA 

<213> Homo sapiens 

<400> 86 

ggcacgaggt ccagtcctga ttccatcttc ttacaagtta gggagctggg tccaggcctg 60 
gatccatgtt attatgaatc aggaagttgg gtccaggcct ggctccatgt tcctgcaggt 120 
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cagggcaggt cttcccccga gtgatggctc ttggactgtg ctcctctggg gccctctcaa 180 

ctctgtgtct gtcatctgtc acctgcctgg ccattatggt tttgatggca gtggatgggc 240 

tccatgggac ttcaggcctg gggtgagact caggaccctg gggtgggcat ggatggggat 3 00 

attggacccc tgaaagaagg gaagctgaga gacttttttc ctttaaagac ttttccatgt 3 60 

tatctccact cagagaattc ttttctgcaa agtcacggga gggaggtgac attgagccct 420 

ccaatgtgac agaaactgtg ctgggaactt tacatgtgtt acctaatttg tttaattatc 480 

ccagcaactc cacaaagtag gcatttttat tgttgaggaa acagaagctt agagactttg 540 

tgagacttgc ccgagacccc aggtcacaca ccagcaagga tgaggtcaag cttttaatcc 600 

aggtctgcct ggctccaagt ccacaccctt tcacaacaat gaactttctt tatgattgca 660 

gatattattt ggggaacttt acatcaaaca ttgactacat aaaacttcaa ccatagacta 72 0 

tattctttgt tttggaaact gtgaagactc aaatttttta taaactcaga acagcttcca 780 

gttttctcta gatatcggaa gatgggctgt gttttttgtc tgttgtccag tgaggctgat 840 

ttgtagtcag acaggtgagt cagtttggtt ggagtaggct attgtggttc tctctcatca 900 

ggaaagaggg gatgcacttg gcccctcaac tccaagttgg tggtgcgatg atttttccat 960 

attctccctt aacaggctgt gagggagtct gggccaggca ctaggccatg agcagggcag 1020 

actggggtaa acccttagcg agcctctctc cagccacgag gaaacctgga gtgtgtgcgt 1080 

gcctgtgtgc tgctggtgtg tgtgtgtgaa tgcacacgtg tgtgcatgca ctgtgagctg 1140 

gtgtgtgcat gtgcactggt gtgtgcgttt gtgtgtgtgt gtgtgtgtgc atgtgtgtgc 12 00 

tgggtgcaca catgcatatg tctctgtgta tacatgtgta tgtgtgccag tgggtgcatg 1260 

tgtttgtaca gtgtgcgtgt gtgtgtgtgt ttgtgcacat gagctgctgc acacatataa 1320 

gccttgtgaa ttaggggaag aagaaaggct ccggcttaca aaaaaaaaaa aaaaaaa 1377 



<210> 87 

<211> 1715 

<212> DNA 

<213> Homo sapiens 



<400> 87 

ggcacgaggg acattggagc tccccacacc actcattgct gcccaccagc tatacaacta 60 

cgtggctgat cacgccagct cttaccacat gaagccattg cgaatggccc ggccaggggg 120 

cccagaacac aacgagtatg ccctggtgtc ggcatggcac agttctggct cctacctgga 180 

ctctgaggga cttcgacacc aggatgactt tgatgtgtct ctgcttgtct gtcactgtgc 240 

tgcacccttt gaggagcaag gagaggctga gcggcacgtt ctgcggctac agttcttcgt 3 00 

ggtgctcacc agccagcgag agctcttccc caggctcact gctgacatgc gccgcttccg 3 60 

gaagccaccc agactgcccc ctgagccaga ggctcctggg agttcagctg gcagccctgg 420 

ggaggcctca gggcttattc tagcgcctgg accggctcct ctgttcccac cactggctgc 4 80 

agaggtgggc atggcacgag cacggctggc tcagctggtg cggctggctg gagggcactg 540 

ccgtcgggac accctttgga agcgcctctt cttgctggag ccaccggggc ctgatcgact 600 

gcggctaggg gggcgcctgg ccctggcaga gctggaggaa ctcctagaag cagtccatgc 660 

caaatccatt ggggacatcg acccccagct ggactgcttc ctatccatga cggtctcctg 720 

gtaccagagc ctgatcaaag ttctcctaag ccgcttcccc agagctgtcg ccatttccaa 780 

agcccagact tgggaactca gtacctggtt gcgctgaatc agaagttcac tgactgctct 840 

gcgctagtgt tctggactcc acttaggaaa gacgtctctg aagtggtttt ccgagaagcc 900 

cttccagtac agccccagga cacgagaagc ccccctgccc aactggtctc cacctaccac 960 

cacctggagt ctgtcatcaa cacagcctgt ttcacccttc tggacccgcc tcctctgaag 1020 

ggagtggact ggaccactga atgtcactgt tccttgaatc atgggcctac cagattgcct 1080 

gccagaggca ggactgacca gcccttctgg gccccagggc aagccagaca ctgagtgaca 1140 

ccaaaggctt tgtaactatg tcttgagggt ctgctgcccc agcctggcag caggaaccgc 12 00 

cctccccaaa cacccacagc cactgaccca tccaggactc cagagagtca ggtcaacccc 12 60 

gaggacccct tgggcccttc tggggtactc ctttcggccc ccctggtaga gtctcgggag 1320 

ttcacacagg gtggcaaaca ccccctagag ctcctctgcc tgaatcctgc cccctagcct 1380 

ttgaccactg tcagccacct gtgtcccttg agccttcggg tcttcacttc ccacttggac 1440 

atcactgctg gacattccca tcgagatgac acctgggttc caatcccagc tctgcctttg 1500 

aagcacttgc ggccaccgtc aagtcccttt gctctcggac cctgggtttc tcatccttta 1560 
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atgaggtggg ttcagaagct ctcccatctt cacagcaacc ctggcactgg cttctcaatg 1620 
ggagggaagt cagcagagaa actgaagtgt tagacactat gtgtcccacc accccattac 1680 
agagacatat gacaatgaaa aaaaaaaaaa aaaaa 1715 



<210> 88 
<211> 417 
<212> DNA 

<213> Homo sapiens 



<400> 88 

ccacgcgtcc gctcctctag aggctccaca tgaagtccca gtgctacagt cctagttatt 6 0 

ttgccttctt ctgcctggtt ttctttcaga tcacctcagc cagttctcag acacttaggg 120 

gacatgttct ctgcaggacc actctgaggg actcttctgc atattgctga cctgagagga 180 

tggcctcaga gctgacttgg gcaatcctcc ccaacaggaa ggggagacat tgcctgccac 240 

tgaggaaaca ggtcatgaag gtggagataa gctgcaaggg gcgaagcaac tttatgtcag 3 00 

tggaaaacgt gtctctttaa agctgctatg tgaacagctt ttacagtcat taaatttacc 360 

taaactaagg ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa 417 



<210> 89 
<211> 1167 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (432) 

<223> n equals a,t,g, or c 



<400> 89 

gggggtgggg caggcgacgg tggggaagat ggcgtaccag agcttgcggc tggagtacct 60 

gcagatccca ccggtcagcc gcgcctacac cactgcctgc gtcctcacca ccgccgccgt 120 

gcagttggaa ttgatcacac cttttcagtt gtacttcaat cctgaattaa tctttaaaca 180 

ctttcaaata tggagattaa tcaccaactt cttatttttt gggccagttg gattcaattt 240 

tttatttaac atgatttttc tatatcgtta ctgtcgaatg ctagaagaag gctctttccg 300 

aggtcggaca gcagactttg tatttatgtt cctttttggt ggattcttaa tgaccctttt 360 

tggtctgttt gtgagcttag ttttcttggg ccaggccttt acaataatgc tcgtctatgt 420 

gtggagccga angaacccct atgtccgcat gaacttcttc ggccttctca acttccaggc 480 

cccctttctg ccctgggtgc tcatgggatt ttccttgttg ttggggaact caatcattgt 540 

ggaccttttg ggtattgcag ttggacacat atattttttc ttggaagatg tatttcccaa 600 

tcaacctggt ggaataagaa ttctgaaaac accatctatt ttgaaagcta tttttgatac 660 

accagatgag gatccaaatt acaatccact acctgaggaa cggccaggag gcttcgcctg 72 0 

gggtgagggc cagcggcttg gaggttaaag cagcagtgcc aataatgaga cccagctggg 780 

aaggactcgg tgatacccac tgggatcttt tatcctttgt tgcaaaagtg tggacacttt 840 

tgacagcttg gcagatttta actccagaag cactttatga aatggtacac tgactaatcc 900 

agaagacatt tccaacagtt tgccagtggt tcctcactac actggtactg aaagtgtaat 960 

ttcttagagc caraaaactg gagaaacaaa tatcctgcca cctctaacaa gtacatgagt 1020 

acttgatttt tatggtataa gcagagcctt ttcttcctct tcttgataga tgaggccatg 1080 

gtgtaaatgg aagtttcaga gaggacaaaa taaaacggaa ttccattttt ctctcactgt 1140 

aaaaaaaaaa aaaaaaaggg cggccgc 1167 



<210> 90 
<211> 1892 
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<212> DMA 

<213> Homo sapiens 



<400> 90 

ccacgcgtcc 

ccttccctca 

gactcaggcc 

actgctgact 

ccggaatccc 

tgctgtctgc 

ctaccgttgc 

tattctctca 

gatgacctcc 

gcctgagagg 

ccaggagcaa 

acaagaacac 

ggagggaaag 

gcagacagac 

tgctccccgg 

cattcgatca 

gagaaaccaa 

tgctattcga 

atggaggagg 

atgtctacct 

gccagcctgc 

ttgcctccca 

acgggctgga 

aaggctgtga 

atggggattc 

cttcaaaagc 

tctgcagaat 

atggagccag 

tgcccacacc 

aggctgcccc 

cagtcccaga 

tgttgatctt 



gcgggaccgg 

ctcctgaagg 

tccactccag 

ccaacctgga 

acactcgtcc 

tccaacctcc 

tccaaccacg 

cctaacactc 

cccatctcac 

ctcagcaaca 

gcgccagagc 

aagcaggaag 

caggaagaag 

tcagagccca 

gtacgagaag 

gcccaggaaa 

aaccctggca 

tcgtggagaa 

agatccttgg 

gtgccctctg 

agcggcaaca 

gagcctgtcc 

tttgtacggt 

agatgtccga 

cctaccaaga 

cagcagtgtc 

gagacttaca 

gagttcagca 

ccagcccaac 

ttctgggtct 

gagggccatg 

caaaaaaaaa 



acggatcttc 

tgctgctcct 

gcagccctct 

aggcagagac 

agctggacca 

cttatgcctc 

tctactatgc 

tcaaggagat 

cccacttcac 

acgtggaaga 

acaagcagga 

aggggcagaa 

gacaggggac 

agtttcactc 

tagagtctac 

tagatgaaat 

gcctcctgca 

tacctgcatc 

tttcgggaag 

tgacttctgc 

atgcgacacc 

atcggcaacc 

gggctccaca 

gtctctgggt 

tttgtgacac 

tgatgagaaa 

gtgcgctgag 

ccttgactct 

ctgcccacgt 

gttactcggc 

gtgggagtgc 

aaaaaaaaaa 



tccggccatg 

gcctctggca 

ctctcctacc 

tacctgccgt 

atatgaaaac 

ctggtttgag 

caagagagtc 

agaagcttca 

agtgacagaa 

gctcctacaa 

gcaaggagtg 

acaggaagag 

taaggaggga 

tgaatctcta 

tcctatgata 

gaatgaaata 

gctgccccac 

ataaccccca 

tcggtctgtg 

tccttgaagc 

tcccacaaga 

aggtagggtc 

tggacttctg 

ggctccagac 

agactatatc 

ccgcaatcgg 

ccctggcaaa 

aggccagttc 

tctctattgt 

ccctactcac 

gccctcctta 

aa 



aggaagccag 

cctgccgcag 

gaatacgaac 

ctccgtgcaa 

cacggcttag 

tctttctgcc 

ctgtgttccc 

gctgaagtct 

cgccagacct 

tcctccttgt 

gagcacaggc 

caagaagagg 

cgggaggctg 

tcttctaacc 

atggagaaca 

tatgatgaga 

acagagcctt 

cagccaaggc 

acagccttgg 

tggagcagtg 

ctccctttgc 

cccagaatca 

gtgtgcccgg 

tgagttcctt: 

cagtacccaa 

aaggtgtccc 

agtgaggacg 

ggatgagctg 

tttgagaccc 

atttccttgg 

aaagatgact 



ccgctggctt 

cccaggattc 

gcttcttcgc 

cccacggctg 

tgcccgatgg 

agttcactca 

agccagtctc 

cacccaccac 

tccagccctg 

ccctgggaag 

aggagccgac 

aacaggaaga 

tgtctcagct 

cttcctcttt 

tccaggagct 

actcctactg 

gctggtgctg 

ctggaagtac 

gcggcgacac 

ccactcagag 

agccccttgc 

ggccgctttt 

cttgccacga 

agcttccagg 

actactgttc 

gcatgagatg 

ttgtgcttcg 

gcgtctattc 

cattgctttc 

gttggagcaa 

ttacataaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1892 



<210> 91 

<211> 523 

<212> DNA 

<213> Homo sapiens 

<400> 91 

cacagcaaag caagttctaa gagccaagct tcagaccaat cccccaccgt gaagtccccc 
gtgcgagtgc cccttgaagg aggttttcta acaggtgagt ggtctgattc tgtctctgtc 
ctgtgggatg gatgggctgg cacttgatgg ctctccttcc ccctcacccy ccacggagaa 
ggctggaagg tgcatttctc agacttcctt gcctgggaaa tgggaagtga tgcagaggat 
cccaacgtct cctcggcagg cttggtggtg gacgtgctgg gccatgttcc aggggccagc 
tgctggctcc gtgggtgctg agaggaaggg ggaaggctgt ctattttttg gccaggatga 
atccagcaga tgtggtaggt cctggccgct tgctgacccg tgggtctacc gggtgctccg 
gagctaatgg tccccagatg ctccaccgtc ctgatgtggc agaggcatgg cattttggcg 
ggccagtttg gtggcatcct gggaaccgtt ttggaggctc gag 



60 
120 
180 
240 
300 
360 
420 
480 
523 
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<210> 92 

<211> 1382 

<212> DNA 

<213> Hoxno sapiens 

<220> 

<221> SITE 
<222> (1382) 

<223> n equals a,t,g, or c 



<400> 92 

gccggctggc agcacgactc gcgtagccgt gcgccgattg cctctcggcc tgggcaatgg 60 

tcccggctgc cggtcgacga ccgccccgcg tcatgcggct cctcggctgg tggcaagtat 120 

tgctgtgggt gctgggactt cccgtccgcg gcgtggaggt tgcagaggaa agtggtcgct 180 

tatggtcaga ggagcagcct gctcaccctc tccaggtggg ggctgtgtac ctgggtgagg 240 

aggagctcct gcatgacccg atgggccagg acagggcagc agaagaggcc aatgcggtgc 3 00 

tggggctgga cacccaaggc gatcacatgg tgatgctgtc tgtgattcct ggggaagctg 3 60 

aggacaaagt gagttcagag cctagcggcg tcacctgtgg tgctggagga gcggaggact 42 0 

caaggtgcaa cgtccgagag agccttttct ctctggatgg cgctggagca cacttccctg 480 

acagagaaga ggagtattac acagagccag aagtggcgga atctgacgca gccccgacag 540 

aggactccaa taacactgaa agtctgaaat ccccaaaggt gaactgtgag gagagaaaca 600 

ttacaggatt agaaaatttc actctgaaaa ttttaaatat gtcacaggac cttatggatt 660 

ttctgaaccc aaacggtagt gactgtactc tagtcctgtt ttacaccccg tggtgccgct 720 

tttctgccag tttggcccct cactttaact ctctgccccg ggcatttcca gctcttcact 780 

ttttggcact ggatgcatct cagcacagca gcctttctac caggtttggc accgtagctg 840 

ttcctaatat tttattattt caaggagcta aaccaatggc cagatttaat catacagatc 900 

gaacactgga aacactgaaa atcttcattt ttaatcagac aggtatagaa gccaagaaga 960 

atgtggtggt aactcaagcc gaccaaatag gccctcttcc cagcactttg ataaaaagtg 102 0 

tggactggtt gcttgtattt tccttattct ttttaattag ttttattatg tatgctacca 1080 

ttcgaactga gagtattcgg tggctaattc caggacaaga gcaggaacat gtggagtagt 1140 

gatggtctga aagaagttgg aaagaggaac ttcaatcctt cgtttcagaa attagtgcta 1200 

cagtttcata cattttctcc agtgacgtgt tgacttgaaa cttcaggcag attaaaagaa 1260 

tcatttgttg aacaactgaa tgtataaaaa aattataaac tggtgtttta actagtattg 1320 

caataagcaa atgcaaaaat attcaataga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 13 80 



<210> 93 

<211> 1747 

<212> DNA 

< 2 1 3 > Homo s api ens 

<400> 93 

ccacgcgtcc ggctacctgt gcatcgtgct 
cgcgccggcc catgggccca ccaacatcat 
cagtttcacc gtgccttcca ccaagggcat 
caacccgtcc agtcagagag ccctctgcct 
cagcatcatc gtccagttca ggtacatcaa 
gttcggggcc atctactacg tcgtgtttac 
cttccgggag tggagcaacg tgggcctggt 
gaccgtctcc gtggggattg tccttataca 
ggagatgaac aaatctaata tgaaaacaga 
aggaataggc attggaggtg gtttctggcc 
gatcatggtg ttagaattga ctggatagta 
ggctcagcac cagagcagag gcccagcagc 



gctcatgctg ctgctgctca 
ggtctacatc agcatctgct 
cgggctggcg gcccaagaca 
gtgcctggta ctcctggccg 
caaggcgctg gagtgcttcg 
cacgctggtc ctgctggcct 
ggacttcttg gggatggcct 
ggtgttcaaa gagttcaatt 
ctagattgca ataggagctt 
gtgattggat gtgaagtaga 
acaggtggtc tggtggatag 
ctctgcagcc caaacgtccc 



tcttctggat 60 

ccttgctggg 120 

tcttgcataa 180 

tgctcggctg 240 

actcctcggt 300 

cagccatcct 360 

gtggattcac 420 

tcaaccttgg 480 

ggatggttcg 540 

agaggtcctc 600 

cggggagcat 660 

aacggtgcct 720 
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ggaccatctc ttctgatgag acgaatctca ttttcatttc cattaacctg gaagctttca 780 

tgaatatttc ttctttaaaa cattttaaca ttatttaaac agaaaaagat gggctctttc 840 

tgggtaggtg gtacatgata gcagagatat ttttacttag attactttgg gaatgagaga 900 

ttgtgtcttg aactctgcac tgtacaggat gtgtctgtag ttgtgttagt ttgcattaag 960 

catgtataca ttcaagtatg tcatccaaat aagaggcata tcattgaatt gtttttaatc 1020 

ctctgacaag ttgactcttc gacccccacc cccacccaag acattttaat agtaaataga 1080 

gagagagaga agagttaatg aacatgaggt agtgttccac tggcaggatg acttttcaat 1140 

agctcaaatc aatttcagtg cctttatcac ttgaattatt aacttaattt gactcttaat 1200 

gtgtatatgt tcttagatta gaataatgca acttcgagta tgctttaata tttcaatatt 1260 

caagttacaa atgtataagg cagttagaaa taatacagtc acatgtcact taatgatagg 132 0 

gaaacattct gagaaatgca ttgtaaggtg actttattgt gtgaacatca tggagtgcac 1380 

ttatacaaac ctagatggga cacctatgac ccacccaggc cagatggtac agcctgttgc 1440 

tcctgggcca cacacctgta cagcatgtga ctgcactgaa taccgcaggc aattgtaaca 1500 

cagtggtgag tatttgtgtt tacaaacata ggaaaggtac agtaaaacta tggtattaca 1560 

atgttatggg accaccgtca tgtaagtggt atgtctttga cagaaacatg gttacgtggt 1620 

tcatgactgt atattcactg gaagatagtc aagactaaag acacattaga gcaaattgac 1680 

ccctttaaca tgtgattatt gtccaattaa agacagttga tttaagtagc aaaaaaaaaa 1740 

aaaaaaa 1747 



<210> 94 
<211> 600 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (553) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (560) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (589) 

<223> n equals a,t,g, or c 



<400> 94 

gaattcggca 

tgctggaact 

ttactgggtc 

tccccaggtg 

cctctctgcc 

caacagctgg 

tacatatttc 

aattaaaagg 

attcccagag 

attgtggttt 



cgagcggcac 
gagtacttcc 
ctcaccgtst 
ccctggacaa 
gctgttgtag 
gcggcctcat 
agttttawag 
aaaaaaaaag 
aattgtattt 
gtnacaattn 



gagccgagat 
gggtccccgc 
tcttcctcat 
cagtgggcct 
atgcatcttc 
cgttctttgc 
catggagawc 
gaagactctc 
aactaattaa 
aactgggtta 



cgttctgggg 
atttggctgg 
tatctacata 
gtgctttaac 
cgtctcccct 
cttcctggtc 
caggaccata 
actgtaaaaa 
tgttttttat 
ctttatttgg 



ctgctggtat 
gtcatgtttg 
acaatgacct 
ggcagtgcct 
gagaaggaca 
accatctgct 
cagtgattta 
cagctgtagg 
attcttaaat 
caagtgttnt 



ggacgcttat 
tagctgtatt 
acaccaggat 
tcgtcttgta 
gtcacaactt 
acgctggaaa 
ccattttgat 
tataatgtat 
ttgctcacaa 
acjgcttttaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



<210> 95 
<211> 586 
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<212> DNA 

<213> Homo sapiens 



<400> 95 

ggcacgaggt tttttccttt ataacggaag ttttataatt catcttttat gtaagtgtaa 60 

ttctcattaa aaatacccta aagcttaaag tttgcaaggc tgcccagcct aacccacaac 120 

agtttgatgc tgccccctag cgtttgattc ccttcacctt ttgctaaaat aaggtaatgt 180 

ttaaattaca attagattta cttactgctg taaatctggt ctattttagt ttcctctggg 240 

tagttagtgt tgctaataag atggacgtaa gtgtttttga actggtgaat tctgattgct 300 

tttagccccc agttttccaa ataggggtga attttgggta gagatagaac aatcaccaag 3 60 

ttaccttgct ccaaaaaaga aatttacgta tgggattgtt ttcaaagcgg gaagttagct 42 0 

gtgtaaataa caacaatttt atatatttaa tctgggcttc tccttatctt gaatgatata 480 

aaaatctact ttctagatta atttagttcc atataacttt gtattgcttt gactgtactg 540 

ataataaagt ttgaaagtgt taaaaaaaaa aaaaaaaaaa aaaaaa 586 



<210> 96 

<211> 802 

<212> DNA 

<213> Homo sapiens 



<400> 96 

ggcacgagcc ctcctccctg ctcgccccca gattcccctc ccctccctgg tgcttttgtc 60 

tggagggtgt tatgggtttg tgtgtgtatg agcgtgtgtg tgtttttgga tttcagacta 120 

attttctgga gtttctgccc ctgctctgcg tcaccctcac gtcacttcgc cagcagtagc 180 

agaggcggcg gcggcggctc ccggaattgg gttggagcag gagcctcgct ggctgcttcg 240 

ctcgcgctct acgcgctcag tccccggcgg tagcaggagc ctggacccag gcgccgccgg 3 00 

cgggcgtgag gcgccggagc ccgggtgagc agcgcagata gtgccctcgg tcgcctcggc 360 

cctcactgtc tccccctggg gcggcctcgg ctactcccca ggtgggacgt gccgcgccac 420 

ctgcccgcgc caccggcacc cagcggccgt ggcggattct gcagcatcat tcgggggccc 480 

cgtcgcggag ccaaagccgc cggcagtctc cgcattcccc tttaaagggt ccttcgcccg 540 

gcctgtacca tggaatcctg tcttggggac cctttcccta cctcccctcc cttggcctca 600 

ggctcgaaga gagagtgggc acactggtgg ctccagcggc gtcagtgcca tcgcggggca 660 

agttgattcc tgggcactca tccatccaca gtctccgggc tggggtcggg gtggggatga 720 

cgcgagcaga gagggagagt gccccaatta gtggtgttgg gggtcctacg ctcagtctta 7 80 

cgcgtgtctg tttgtcctca gc 802 



<210> 97 

<211> 1226 

<212> DNA 

<213> Homo sapiens 



<400> 97 

ggcacgagca tgctttgctt acaatggagt ctgcagtgag gggagatgct gggatagcca 60 

tttccatggc tctgttatgc aagcacaaat ttcatctcct agatggactt cctggttttc 120 

tcttactgca gtaacactgg ccttcccttc tctaattcct taccccagct gcggcatccc 180 

tgtgttaact caggatgcca agtggccctc agattacact tctccagata gctgaatgag 240 

tctgctttca ctgtgactgg gacctgaatg acctgcagtc agggcccaga gttgggactc 3 00 

tatactaccc tgggctctgg tctgtaggtt tgtagtagcc accggtaata agccaagggc 3 60 

taggctcttg tttgagttta tggccacctg gaattttcag tcatctcatg atacaggcgg 420 

gaggggcaga acagatagat tacgacaggt ttggttttta aattttccaa ccaagtggaa 4 80 

aggcaagttg gtcttataga aagcactact gcacttagta gctatgtgat tttgagcaaa 540 

ccacataatc tctctaggtc cattttccta accacaagat aaagatgtta cattgtcaaa 600 

gcttgccgta gatttggggt gaatgaaaat tattccttgc tttcatcact acctttatag 660 
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ctctcatcac tacctttata gctcatcact gtgccttttt ttctttccta agaaagacat 720 

cacatccctc tcctctcctc ctctgtgctc ctgtccctcc ctccccctag caaggtccag 780 

gcaaagctgg agatgaagct gaagatccag agtttcctag aacgcaactt aaggatggct 840 

aaggaaaggg aagcctgact gctcggtcag gagggtgcag tatctcttgc tgggaacaca 900 

gccagtttcc acaatgccta gactgtgtat gtctatttgc acaagattgg cttttcctat 960 

tttggagtgg tcagacattt tatttttgtt caagattatc tggcgtttta gacaaatttg 1020 

caaaactgtg cttttattga ctttttgaat aaactttggt attctggagc aaatgtattt 1080 

atttattggt atgtgcaatg acaaacttgg tatttttccc atgtttgaca tttatgttat 1140 

gtttgttaga attttagtgt ttgtctaagt acacacatat atcaacaaat taaacttgaa 1200 

tcgtttcaaa aaaaaaaaaa aaaaaa 122 6 



<210> 98 

<211> 1120 

<212> DNA 

<213> Homo sapiens 



<400> 98 

aggggactct caccctctcc cagcaatgtc taaagtcagg catctgaaaa ccagcagtaa 60 

tcctgcctct gaagtttatc aggaaaggag cttaaaagag aaccaaattc agcctgtgtt 12 0 

ggaactctca gtcccagagg ggtgtggttt atagctctcc ggcctgctgt tggacttagg 180 

ctgtgaccca cagaaggacg ccagaaagta ctcaagacat tcacggtgcc ccggtcagca 2 40 

ctcgccatga cgaagacttc tacatgcata taccacttcc ttgttctgag ctggtatact 3 00 

ttcctcaatt attacatctc acaggaagga aaagacgagg tgaaacccaa aatcttggca 3 60 

aatggtgcaa ggtggaaata tatgacgctg cttaatctgc tcttgcagac cattttctac 420 

ggggtcacct gcctggatga tgtgctgaaa agaaccaaag ggggaaaaga cattaagttc 480 

ctaactgcct tcagagacct gcttttcacc actctggctt ttcctgtatc cacgtttgta 540 

tttttggcat tctggatcct ctttctctac aatcgagatc tcatttaccc caaggtccta 600 

gatactgtca tccccgtgtg gctgaatcat gcaatgcaca ctttcatatt ccccatcaca 660 

ttggctgaag tcgtcctcag gcctcactcc tatccatcaa agaagacagg actcaccttg 720 

stggctgctg ccagcattgc ttacatcagc cgcatcctat ggctctactt tgagacgggt 7 80 

acctgggtgt atcctgtgtt tgccaaactc agcctcttgg gtctagcagc tttcttctct 840 

ctcagctacg tcttcatcgc cagcatctac ctacttggag agaagctcaa ccactggaaa 900 

tggggtgaca tgaggcagcc acggaagaag aggaagtaat tgcacaccat tttccaagaa 960 

ccaagaaaga agaaaacaca agagattttt ctcatctttt tttttttttt tctggtggag 1020 

ggaggtggtg gaggaacata gcaaagtagg agggacagag agtgatactt aaatttaata 1080 

agaggttcgt gaaggtaaaa aaaaaaaaaa aaaactcgag 112 0 



<210> 99 

<211> 2596 

<212> DNA 

<213> Homo sapiens 



<400> 99 

ccacgcgtcc gacttggcaa gcgttcacaa ccaaaatggc cagctctttc tggaagatat 60 

tgtaaaacgt gatggatttc cactatgggt tgggctctca agtcatgatg gaagtgaatc 120 

aagttttgaa tggtctgatg gtagtacatt tgactatatc ccatggaaag gccaaacatc 180 

tcctggaaat tgtgttctct tggatccaaa aggaacttgg aaacatgaaa aatgcaactc 240 

tgttaaggat ggtgctattt gttataaacc tacaaaatct aaaaagctgt cccgtcttac 300 

atattcatca agatgtccag cagcaaaaga gaatgggtca cggtggatcc agtacaaggg 3 60 

tcactgttac aagtctgatc aggcattgca cagtttttca gaggccaaaa aattgtgttc 420 

aaaacatgat cactctgcaa ctatcgtttc cataaaagat gaagatgaga ataaatttgt 480 

gagcagactg atgagggaaa ataataacat taccatgaga gtttggcttg gattatctca 540 

acattctgtt gaccagtctt ggagttggtt agatggatca gaagtgacat ttgtcaaatg 600 
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ggaaaataaa agtaagagtg gtgttggaag atgtagcatg ttgatagctt caaatgaaac 660 

ttggaaaaaa gttgaatgtg aacatggttt tggaagagtt gtctgcaaag tgcctctggg 720 

ccctgattac acagcaatag ctatcatagt tgccacacta agtatcttag ttctcatggg 780 

cggactgatt tggttcctct tccaaaggca ccgtttgcac ctggcgggtt tctcatcagt 840 

tcgatatgca caaggagtga atgaagatga gattatgctt ccttctttcc atgactaaat 900 

tcttctaaaa gttttctaat ttgcactaat gtgttatgag aaattagtca cttaaaatgt 960 

cccagtgtca gtatttactc tgctccaaag tagaactctt aaatactttt tcagttgttt 1020 

agatcttagg catgtgctgg tatccacagt taattccctg ctaaatgcca tgtttatcac 1080 

cctaattaat agaatggagg ggactccaaa gctggaactg aagtccaaat tgtttgtaca 1140 

gtaatatgtt taatgttcat tttctctgta tgaatgtgat tggtaactag atatgtatat 1200 

tttaatagaa tttttaacaa aacttcttag aaaattaaaa taggcatatt actaggtgac 1260 

atgtctactt tttaattttt aagagcatcc ggccaaatgc aaaattagta cctcaaagta 1320 

aaaattgaac tgtaaactct atcagcattg tttcaaaata gtcattttta gcactgggga 13 80 

aaaataaaca ataagacatg cttacttttt aatttttatt tttttgagac tgagtctctc 1440 

tctgttgccc aggctggagt acaatggcgt gatctcggct cactgcaaat ctccgcctcc 1500 

caggttcaag cgattctcct gcctcagcct cctgagtagc tgggattaca ggcaactgcc 1560 

accatgcccg gctaattttt gtatttttag tagagatggg gtttcaccat gttggccagg 1620 

ctggtctcga actcgtgacc gcaggtgatc ctcccgcctc ggcctcccaa agtgctggga 1680 

ttacaggcat gagccaccgc gcctggcctc tgcttacttt ttatatagca aaatgattcc 1740 

tcttggcaag atgtttctta tattattcca aagttatttc ataccattat tatgtaaata 1800 

tgaagagttt ttttctgttt ataattgttt ataaaacaat gacttttaaa gatttagtgc 1860 

ttaacatttt cccaagtgtg ggaacattat ttttagattg agtaggtacc ttgtagcagt 1920 

gtgctttgca ttttctgatg tattacatga ctgtttcttt tgtaaagaga atcaactagg 1980 

tatttaagac tgataatttt acaatttata tgcttcacat agcatgtcaa cttttgacta 2040 

agaattttgt ttactttttt aacatgtgtt aaacagagaa agggtccatg aaggaaagtg 2100 

tatgagttgc atttgaaaaa tgagactttt tcagtggaac tctaaacctt gtgatgacta 2160 

ctaacaaatg taaaattatg agtgattaag aaaacattgc tttgtggtta tcactttaag 222 0 

ttttgacacc tagattatag tcttagtaat agcatccact ggaaaaggtg aaaatgtttt 2280 

attcagcatt taacttacat ttgtacttta gagtattttt gtataaaatc catagattta 2340 

ttttacattt agagtattta cactatgata aagttgtaaa taattttcta agacagtttt 2400 

tatatagtct acagttgtcc tgatttctta ttgaatttgt tagactagtt ctcttgtctt 2460 

gtgatctgtg tacaatttta gtcactaaga ctttcctcca agaactaagc caacttgatg 2520 

tgaaaagcac agctgtatat aatggtgatg tcataataaa gttgttttat cttttaagta 2580 

aaaaaaaaaa aaaaaa 259 6 



<210> 100 
<211> 1020 
<212> DNA 

<213> Homo sapiens 



<400> 100 

aaactagggg aaaatgtagc caacatatac aaagatcttc agaaactctc tcgcctcttt 60 

aaagaccagc tggtgtatcc tcttctggct tttacccgac aagcactgaa cctaccagat 12 0 

gtatttgggt tggtcgtcct cccattggaa ctgaaactac ggatcttccg acttctggat 180 

gttcgttccg tcttgtcttt gtctgcggtt tgtcgtgacc tctttactgc ttcaaatgac 240 

ccactcctgt ggaggttttt atatctgcgt gattttcgag acaatactgt cagagttcaa 300 

gacacagatt ggaagactgt acaggaagag gcacatacaa agaaaagaat ccccgaaagg 360 

gcggtttgtg atgctcctgc catcgtcaac tcacaccatt ccattctatc ccaacccctt 420 

gcaccctagg ccatttccta gctcccgcct tcctccagga attatcgggg gtgaatatga 480 

ccaaagacca acacttccct atgttggaga cccaatcagt tcactcattc ctggtcctgg 540 

ggagacgccc agccagtttc ctccactgag accacgcttt gatccagttg gcccacttcc 600 

aggacctaac cccatcttgc cagggcgagg cggccccaat gacagatttc cctttagacc 660 

cagcaggggt cggccaactg atggccggct gtcattcatg tgattgattt gtaatttcat 720 

ttctggagct ccatttgttt ttgtttctaa actacagatg tcaactcctt ggggtgctga 780 
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tctcgagtgt tattttctga ttgtggtgtt gagagttgca ctcccagaaa ccttttaaga 840 

gatacattta tagccctagg ggtggtatga cccaaaggtt cctctgtgac aaggttggcc 900 

ttgggaatag ttggctgcca atctccctgc tcttggttct cctctagatt gaagtttgtt 960 

ttctgatgct gttcttacca gattaaaaaa aagtgtaaat taaaaaaaaa aaaaaaaaaa 1020 



<210> 101 

<211> 1520 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (71) 

<223> n equals a,t,g, or c 
<220> 

<221>- SITE 
<222> (473) 

<223> n equals a,t,g, or c 



<400> 101 

gcttttttct 

cagcgttacg 

ttaatattta 

aaaaatacca 

tgcgtcacat 

ttttcttctt 

gggtctggtt 

agactcttta 

tttcctactg 

gggttttgag 

tctagcaaac 

atttcattaa 

agtatccttt 

tctatagata 

ctcaatgggg 

tccaaaggac 

agaaacaaaa 

ggaacaggat 

cagatattta 

agaatagtct 

ggcagattca 

gatttaatag 

ctttagaaga 

aacttatttt 

ggaatagtat 

aaaaaaaaat 



taagtgcaca 
ngatgcttag 
tcagcagtat 
caagtgtaat 
gaccttctat 
ttaaagtcga 
cactttgtaa 
gttttcctgc 
acaaaccaag 
caggagcgag 
tttttacatg 
agtacctggg 
tggccagtgg 
aatctgacat 
gctgtatcac 
cataatactt 
gtctaaaaag 
gaggaaggag 
tggacattaa 
taaggctcct 
ctggccttag 
gagaaactac 
gtaaatgaaa 
ccctggagtc 
taagattaca 
gaccctcgag 



aagcatcata 
cattttgaat 
cataatttcc 
tactctagca 
tgttcatggg 
ctgtagcatc 
aagtaaacca 
agattgttca 
tacttgttac 
attaccaccc 
ttgcacattt 
tgtagtactc 
tcctgttttt 
ttgtcatcca 
ccctagattg 
gagcaaatat 
ggacaataat 
gagatactga 
acagatattt 
gggaattgat 
catttcagta 
tttctgcgtt 
gcatgcttcg 
ttgtgaagtg 
agcacatttt 



ctccctggag 
attgtggcaa 
atcctcttat 
cagctattaa 
tttaaagaga 
ttggcttttg 
tgtctgttta 
agattacatg 
atcaccaatg 
aaaaagggag 
cagttcttaa 
aagtcccccc 
gcccctaccc 
ataccattgc 
gttctgagat 
gaacatttct 
gaaaaaacag 
caggagccct 
atggagcacg 
gataggccat 
attatattta 
tcttttaatt 
atgctgccac 
tgaatttaaa 
atattcatga 



gcaaacacat 
aaaaattaaa 
ttcagaattt 
tgtgctggat 
aagcagggct 
tctggggtgg 
aacaatagag 
ataatcacac 
gtaccaggag 
ctacctgagg 
atgaaggcta 
tcaagagttc 
agactgttcg 
agtcctctgc 
actgcaatgt 
tggggtgagg 
ttgagacctt 
gggtcttgct 
actctgtacc 
ttacccagtt 
tttattttta 
acttgtagtt 
tgtaaatacc 
gcctgctcta 
gccggaaagg 



cgggctgctt 
agttcactta 
cacttgaggc 
gataggccac 
ttgtatttct 
ggaggatctg 
gtgtttaaga 
ggngtattta 
atgaagacgc 
cagcccagct 
ctccagtgtc 
ataagtaagc 
rgaagcatat 
agcatacatt 
cttgtgtcct 
gcagaaagag 
tagtatgatg 
ctgcattaaa 
ctacaggccc 
tcagtttaga 
gcctgaacca 
tacacagtaa 
attcattagt 
tctggaatat 
caaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1520 



<210> 102 

<211> 1306 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (1300) 

<223> n equals a,t,g, or c 



<400> 102 

aattcccggg tcgacccacg cgtccggaat ttaagggacc cacactacct tcccgaagtt 60 

gaaggcaagc ggtgattgtt tgtagacggc gctttgtcat gggacctgtg cggttgggaa 12 0 

tattgctttt cctttttttg gccgtgcacg aggcttgggc tgggatgttg aaggaggagg 180 

acgatgacac agaacgcttg cccagcaaat gcgaagtgtg taagctgctg agcacagagc 240 

tacaggcgga actgagtcgc accggtcgat ctcgagaggt gctggagctg gggcaggtgc 300 

tggatacagg caagaggaag agacacgtgc cttacagcgt ttcagagaca aggctggaag 3 60 

aggccttaga gaatttatgt gagcggatcc tggactatag tgttcacgct gagcgcaagg 42 0 

gctcactgag atatgccaag ggtcagagtc agaccatggc aacactgaaa ggcctagtgc 480 

agaagggggt gaaggtggat ctggggatcc ctctggagct ttgggatgag cccagcgtgg 540 

aggtcacata cctcaagaag cagtgtgaga ccatgttgga rgargaggar gaagaggagg 600 

aagaggaagg gggagacaag atgaccaaga caggaagcca ccccaaactt gaccgagaag 660 

atctttgacc cttgcctttg agcccccagg aggggaaggg atcatggaga gccctctaaa 720 

gcctgcactc tccctgctcc acagctttca gggtgtgttt atgagtgact ccacccaagc 780 

ttgtagctgt tctctcccat ctaacctcag gcaagatcct ggtgaaacag catgacatgg 84 0 

cttctggggt ggagggtggg ggtggaggtc ctgctcctag agatgaactc tatccagccc 900 

cttaattggc aggtgtatgt gctgacagta ctgaaagctt tcctctttaa ctgatcccac 960 

ccccacccaa aagtcagcag tggcactgga gctgtgggct ttggggaagt cacttagctc 102 0 

cttaaggtct gtttttagac ccttccaagg aagaggccag aacggacatt ctctgcgatc 1080 

tatatacatt gcctgtatcc aggaggctac acaccagcaa accgtgaagg agaatgggac 114 0 

actgggtcat ggcctggagt tgctgataat ttaggtggga tagatacttg gtctacttaa 1200 

gctcaatgta acccagagcc caccatatag ttttataggt gctcaatttt ctatatcgct 1260 

attaaacttt tttctttttt tctaaaaaaa aaaaaaaaan actcga 1306 



<210> 103 
<211> 785 
<212> DNA 
<213> Homo sapiens 



<400> 103 

cttttagaag gtacgcctgc aggtaccggt ccggaattcc cgggtcgacc cacgcgtccg 60 

ggaaatgaac taccatttat aacttctgtt tttttattga gaaaatgatt cacgaattcc 120 

aaatcagatt gccaggaaga aataggacgt gacggtactg ggccctgtga ttctcccagc 180 

ccttgcagtc cgctaggtga gaggaaaagc tctttacttc cgcccctggc agggacttct 240 

gggttatggg agaaaccaga gatgggaatg aggaaaatat gaactacagc agaagcccct 3 00 

gggcagctgt gatggagccc ctgacattac tcttcttgca tctgtcctgc cttctttccc 360 

tctgcgaggc agtggggtgg gattcagagt gcttagtctg ctcactggga gaagaagagt 420 

tcctgcgcat gcaagccctg ctgtgtggct gtcgtttaca tttgggaggt gtcctgtatg 480 

tctgtacgtt ggggactgcc tgtatttgga agatttaaaa acctagcatc ctgttctcac 540 

cctctaagct gcattgagaa atgactcgtc tctgtatttg tattaagcct taacactttt 600 

cttaagtgca ttcggtgcca acatttttta gagctgtacc aaaacaaaaa gcctgtactc 660 

acatcacaat gtcattttga taggagcgtt ttgttatttt tacaaggcag aatggggtgt 720 

aacagttgaa ttaaacttag caatcacgtg ctcaaaaaaa aaaaaaaaaa aaaaagggcg 780 

gccgc 785 



<210> 104 
<211> 2015 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> SITE 
<222> (3) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (9) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1981) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1990) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (2001) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (2002) 

<223> n equals a,t,g, or c 
<400> 104 

ggagccagga 60 
ctcttcctcc 120 
gaaattgatt 180 
ttcctggccc 240 
ctgaaaagca 3 00 

cgaaaccaca 3 60 

gtgaagaggg 420 
cattcaaaca 480 
atagaaggaa 540 
aagaaacgga 600 
caccaaaagc 660 
gtccgtggaa 720 
cttcctacac 780 
accctcttag 840 
ctgaaacatg 900 
ggaggggaaa 960 
gcctgctgct 1020 
tgcttgcctt 1080 
tgtggttatt 1140 
ggaaagccat 12 00 

ttgtaatatt 1260 
ttccaggaaa 1320 



ccnggaatnc 
gccaagagca 
tggatgtagg 
tcaagtacgc 
tgaagatctg 
crcctggggg 
atttctccaa 
gttctagccc 
cccaaggctg 
tagaatatac 
ctgcacagag 
ttcaccctgt 
cctaccacag 
actgccatct 
ctgccaggag 
gggactccgg 
tggcagtctg 
ctcagacctt 
gcagagggcc 
attacaatgt 
gacattctac 
aaccctgtat 



cgggtcgacc 
gagcgccagc 
aggagctcaa 
cctcatcggg 
catgatcagg 
cctcagtgac 
aagagatgca 
cgtggaaaac 
gtaatgaact 
agtgcctctg 
gaaggattga 
gttggagctg 
aaaatggctc 
tatcaacagt 
atcccttctt 
gatggtctct 
ggctggcgtg 
caaaggatgg 
atgaatgtca 
acatggctgt 
aattgccatc 
ttctgggatc 



cacgcgtccg 
atgaacttgg 
gtgctggcaa 
actgctgtgg 
aggcacttat 
accatcccgc 
caggtgattg 
agcccatggt 
ttcacatgga 
tcctgaagga 
gcaatttagc 
ttcatgcttc 
atgaaaaggg 
taggcactac 
aaagatggac 
agccctatcg 
gtaggaaggg 
aaccaacgaa 
gttattattt 
tgcatagaag 
aggctaaggc 
acatcacgga 



gcctgcgctg 
gggtcagcat 
caggcaagac 
gtgtcgccat 
ttgacgacga 
taaagaagag 
agctgtaggt 
taacatctca 
ctgaatattg 
aaatatcatg 
ctgcagtgga 
catgaggcca 
gaatccgacc 
tttgtagaac 
tatgtgaaga 
atgatgaaca 
ctttggtgtt 
ggaccaaatg 
ttctccttat 
acatgactgg 
cccgtgagca 
atattctttg 



ccagcagcca 
gctgaggatc 
ccctggggct 
atctgctggc 
ctcttccgac 
agccccaagg 
gagcagtgac 
ggatgtcctg 
gaggcaaata 
cctcttctgg 
agaaggtgga 
tggtgtccat 
caacacacag 
gattagcttc 
ttcgggagtc 
ctggccttct 
catggaatgg 
agaaagcaga 
acaattattt 
tggaggctga 
tttctctccc 
cctttccact 
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tctctcggac 
agtgctgtgt 
gatgctgttc 
aagcaatttt 
aaaaaaaaaa 
acgtcatagc 
tcgtgactgg 
cgccagctgg 
cctgaatggc 
tacgcgcagc 
cccttccttt 
nttaagggtn 



tgggctaccc 
tgtcatratt 
attagcagcc 
ctgtgtgtag 
aaaaagggcg 
tcttctatag 
gaaaaccctg 
cgtaatagcg 
gaatgggacg 
gtgaccgcta 
ctcgccacgt 
ccaattaagg 



tccttgtgtg 
tgcctggact 
tttgttaact 
gataaaataa 
gccgctctag 
tgkcacctaa 
gcgttaccca 
aagaggcccg 
cgccctgtag 
cacttgccag 
tcgccgggtt 
nnttaccggg 



tgatgaaaga 
cccagggcgt 
gataaccaag 
accatcttgt 
aggatccaag 
attcaattca 
acttaatcgc 
caccgatcgc 
cggcgcatta 
cgccctagcg 
tccccgtcaa 
acctt 



tgagctatat 
ctcttaccca 
agcggtaatg 
atgggaaaaa 
cttacgtacg 
ctggccgtcg 
cttgcagcac 
ccttcccaac 
agcgcggcgg 
cccgctcctt 
gctttaaatc 



ttcagaacaa 
acttgataac 
tgatactcat 
aaaaaaaaaa 
cgtgcatgcg 
ttttacaacg 
atcccccttt 
agttgcgcag 
gtgtggtggt 
tcgctttctt 

gggggcttcc 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2015 



<210> 105 
<211> 367 
<212> DNA 

<213> Homo sapiens 



<400> 105 

cggcacgagt 

gaataatacc 

atgtcttatc 

gaattactta 

ctgggaagca 

accaacaaat 

aaaaaaa 



gtaaatgtca 
acaacactta 
agctgcctct 
cagaaaccaa 
ggaaaagaaa 
atgtcctaaa 



ccaccaaagg 
tggcctgtct 
ctccagaaat 
cctttgcatt 
aaagtacatc 
aaccaccaag 



tttgcaccct 
tggaggcctt 
gaactgtgat 
aggtgagctt 
actgaaagta 
gaaacctact 



gatcaaaaag 
ctggggatta 
ggtggacaca 
tatcctcctc 
aaagcaactg 
ccaaaaatga 



agtatgaaaa 
ttggtgtgat 
gctatgtgag 
tgataaatct 
ttataggttt 
aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
367 



<210> 106 
<211> 1889 
<212> DNA 

<213> Homo sapiens 



<400> 106 
ctcatccttc 
attggtatca 
atgtgaaaaa 
ccagtcctac 
gcaattattt 
ttttcccaaa 
cattagaaaa 
gacaacatca 
tcccccaaag 
tgcaggccca 
tggatcagca 
ctccttcctg 
gaatttcttt 
gtgtgtcaga 
gctccctgga 
ctgtctgcct 
gggaaagctg 
actaccactt 
ccatcatctc 
cctttgtttg 



tatcatcata 
tatcagtttc 
catcagagag 
aaatggggct 
tggtgatgta 
agctcgattt 
ggtctcgtct 
cgaaggtggt 
cacagatcca 
gtgtgtcaaa 
agggcacacg 
ggcttctctt 
gctccactgt 
tcttcactcg 
gcccgtgtgc 
ttatgaggag 
tgtggtttcc 
cctgtcctca 
tcagcaaggt 
gttggatgta 



tggagtggca 
actgaccggc 
ctcgtactgg 
gaaactgttg 
catttttatg 
gcatctgaat 
acagaggact 
aacaaacaaa 
ttacgcacat 
acagaaactg 
atgggggcac 
gatacggagg 
tgccagtagc 
gattattcga 
tctcgtgtga 
ccagtgtctg 
ttttaccttt 
ccgaaggagg 
gacatatttg 
ggaagcatcc 



ataatgaaaa 
caatctacat 
caggagacaa 
cagaagcctg 
actatatcag 
atggatatca 
ggtctttcaa 
tgctttatca 
ttaaagatac 
aattctaccg 
tttattggca 
aaagtggaaa 
tttgaaatga 
tgacactcag 
ctgaacgttt 
aattgctgag 
cagctgacca 
ccgtggggct 
tttttgacct 
cagggagatt 



tgaggaggcg 
caaggactat 
gagtcgtcct 
ggtctctcaa 
tgattgctgg 
gtcctggccg 
tagcaagttt 
ggctggactt 
catctacctt 
ccgtagtcgc 
gttgaatgac 
atgcttcatt 
aaacatgttc 
tgtgagagtc 
tgtgatgaaa 
gagatgtggg 
tgaactcctg 
ctgcaaggcg 
ggagacctca 
tagtgacaat 



ctgatgatga 
gtgacactct 
tttattacgt 
aaccctaata 
aactggaaag 
tccttcagta 
t c act teat c 
catttcaaac 
actcaggtga 
agcgagatag 
atctggcaag 
actttgctca 
tatatctatg 
catacatgga 
ggaggagagg 
aattgcacac 
agcccgacca 
cagatcactg 
gctgtcgctc 
ggtttcctca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



BNSDOCID: <WO_9947540A1 J_> 



WO 99/47540 



PCT/US99/05804 



62 



tgactgagaa gacacgaact atattatttt acccttggga gcccaccagc aagaatgagt 12 60 

tggagcaatc ttttcatgtg acctccttaa cagatattta ctgaaggaat ctaggttgta 1320 

ttttcagtgg acaatgggaa taaagcattt ctaaagcacc gactggagag gaaggcaaca 13 80 

gagacaagga gagaagccga gagacatgtc tgcgtgctgc cacgcatctg agcgattgct 1440 

ctgtgaagag ttgtacactg aacattttca ggggaggctg tttacccagg caatgtcctc 1500 

aaacaagcct gtgccggggt gtcctggaat ctgtgccagg actgtgtttt tagcccttca 1560 

cctctcagct ttagcaggac atgaaccagt tataacaaga tggccctgca gctggttaca 1620 

agaatgtgac atggcaggat ctatggaacc aaatggaagg ttttgaggtg atgtaggtct 1680 

ttcacagtta gctttgggga atacagaata ctcaaataaa gtgctttgtt attatttcag 1740 

agggaatggc gattgaaatg ttacaacaga gatttcttgg tggtagctat ttgggtaaag 1800 
gtatatggat atttttctgt acatgtgaaa ttatataaaa ataaaagtta tataaattac * 18 60 

attgaaaaaa aaaaaaaaaa aaaaaaaaa 1889 



<210> 107 

<211> 1201 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (1086) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
<222> (1161) 

<223> n equals a,t,g. or c 



<220> 

<221> SITE 
<222> (1176) 

<223> n equals a,t,g, or c 



<400> 107 

cggcacgagc ggctggcagc acgactcgcg taccgtgcgc cgattgcctc tcggcctggg 60 

caatggtccc ggctgccggt cgacgaccgc cccgcgtcat gcggctcctc ggctggtggc 12 0 

aagtattgct gtgggtgctg ggacttcccg tccgcggcgt ggagggacct tatggatttt 180 

ctgaacccaa acggtagtga ctgtactcta gtcctgtttt acaccccgtg gtgccgcttt 24 0 

tctgccagtt tggcccctca ctttaactct ctgccccggg catttccagc tcttcacttt 300 

ttggcactgg atgcatctca gcacagcagc ctttctacca ggtttggcac cgtagctgtt 360 

cctaatattt tattatttca aggagctaaa ccaatggcca gatttaatca tacagatcga 42 0 

acactggaaa cactgaaaat cttcattttt aatcagacag gtatagaagc caagaagaat 480 

gtggtggtaa ctcaagccga ccaaataggc cctcttccca gcactttgat aaaaagtgtg 540 

gactggttgc ttgtattttc cttattcttt ttaattagtt ttattatgta tgctaccatt 600 

cgaactgaga gtattcggtg gctaattcca ggacaagagc aggaacatgt ggagtagtga 660 

tggtctgaaa gaagttggaa agaggaactt caatccttcg tttcagaaat tagtgctaca 720 

gtttcataca ttttctccag tgacgtgttg acttgaaact tcaggcagat taaaagaatc 780 

atttgttgaa caactgaatg tataaaaaaa ttataaactg gtgttttaac tagtattgca 840 

ataagcaaat gcaaaaatat tcaatagatg cactattctt gtttttactg catgmacgta 900 

atccagtatt tggkaaagta atccaktttg aaatgtgrag rtgtattccg gcagaatagt 960 

gagtagaatg acagcttact atacagaagg cmaaaatagg actctcaggt aatagtttaa 102 0 

ggaaaccctt gattccttat gatgatgttt aagaaaggtt agttttctgt ttctttgcca 1080 

gttttncttc taggagtcca tagccaggga aagtatgtga accagaattg gttagtgtga 114 0 

ccccctccaa gtagccagtg ntgggaaata agggtncaat accttgatgt ttgtgatctc 1200 
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<210> 108 

<211> 75 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (75) 

<223> Xaa equals stop translation 
<400> 108 

Met Asp Pro Leu Cys Leu Pro lie lie Leu Phe Ser Ala Val Val Leu 
15 io 15 

Arg Asn Leu Phe His Leu Leu lie Leu Thr Phe His Tyr Leu Pro Leu 
20 25 30 

Phe Cys Asp Asn Pro Leu lie Leu Glu Asp Leu Ser Cys lie His Leu 
35 40 45 

Arg Val Asn lie Phe Lys Ala Lys Gin Pro Lys Phe Tyr Gly Asn Gin 
50 55 60 

Leu Gin Pro Cys Val Met Lys Ser Ser Ala Xaa 
65 70 75 



<210> 109 
<211> 202 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (202) 

<223> Xaa equals stop translation 
<400> 109 

Met Lys Leu Leu lie Leu Phe Leu Ser His Leu Leu Ser Leu Ala Phe 
1 5 10 15 

Gly lie Leu Cys Leu Ser Val Thr Val lie Leu Ser Leu Leu Leu Ser 
20 25 30 

Phe Ser Lys Arg Gly Phe Ser Val Arg Ser Phe Gly Thr Gly Thr His 
35 40 45 

Val Lys Leu Pro Gly Pro Ala Pro Asp Lys Pro Asn Val Tyr Asp Phe 
50 55 60 

Lys Thr Thr Tyr Asp Gin Met Tyr Asn Asp Leu Leu Arg Lys Asp Lys 
65 70 75 80 



1201 
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Glu Leu Tyr Thr Gin Asn Gly lie Leu His Met Leu Asp Arg Asn Lys 
85 90 95 

Arg lie Lys Pro Arg Pro Glu Arg Phe Gin Asn Cys Lys Asp Leu Phe 
100 105 110 

Asp Leu lie Leu Thr Cys Glu Glu Arg Val Tyr Asp Gin Val Val Glu 
115 120 125 

Asp Leu Asn Ser Arg Glu Gin Glu Thr Cys Gin Pro Val His Val Val 
130 135 140 

Asn Val Asp lie Gin Asp Asn His Glu Glu Ala Thr Leu Gly Ala Phe 
145 150 155 160 

Leu lie Cys Glu Leu Cys Gin Cys lie Gin His Thr Glu Asp Met Glu 
165 170 175 

Asn Glu lie Asp Glu Leu Leu Gin Glu Phe Glu Glu Lys Ser Gly Arg 
180 185 190 

Thr Phe Leu His Thr Val Cys Phe Tyr Xaa 
195 200 



<210> 110 
<211> 371 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (31) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (193) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 110 

Met Gly Leu Lys Leu Leu Gin Lys Pro Gly Ser Leu Lys Thr Leu lie 
15 10 15 

Ala lie He Leu Val Met Tyr He Phe Met Thr He Ser Val Xaa Cys 
20 25 30 

Trp Asn Trp Lys Val Phe Pro Lys Ala Arg Phe Ala Ser Glu Tyr Gly 
35 40 45 

Tyr Gin Ser Trp Pro Ser Phe Ser Thr Leu Glu Lys Val Ser Ser Thr 
50 55 60 

Glu Asp Trp Ser Phe Asn Ser Lys Phe Ser Leu His Arg Gin His His 
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65 70 75 80 

Glu Gly Gly Asn Lys Gin Met Leu Tyr Gin Ala Gly Leu His Phe Lys 
85 90 95 

Leu Pro Gin Ser Thr Asp Pro Leu Arg Thr Phe Lys Asp Thr lie Tyr 
100 105 HO 

Leu Thr Gin Val Met Gin Ala Gin Cys Val Lys Thr Glu Thr Glu Phe 
115 120 125 

Tyr Arg Arg Ser Arg Ser Glu lie Val Asp Gin Gin Gly His Thr Met 
130 135 140 

Gly Ala Leu Tyr Trp Gin Leu Asn Asp lie Trp Gin Ala Pro Ser Trp 
145 150 155 160 

Ala Ser Leu Glu Tyr Gly Gly Lys Trp Lys Met Leu His Tyr Phe Ala 
165 170 175 

Gin Asn Phe Phe Ala Pro Leu Leu Pro Val Gly Phe Glu Asn Glu Asn 
180 185 190 

Xaa Phe Tyr lie Tyr Gly Val Ser Asp Leu His Ser Asp Tyr Ser Met 
195 200 205 

Thr Leu Ser Val Arg Val His Thr Trp Ser Ser Leu Glu Pro Val Cys 
210 215 220 

Ser Arg Val Thr Glu Arg Phe Val Met Lys Gly Gly Glu Ala Val Cys 
225 230 235 240 

Leu Tyr Glu Glu Pro Val Ser Glu Leu Leu Arg Arg Cys Gly Asn Cys 
245 250 255 

Thr Arg Glu Ser Cys Val Val Ser Phe Tyr Leu Ser Ala Asp His Glu 
260 265 270 

Leu Leu Ser Pro Thr Asn Tyr His Phe Leu Ser Ser Pro Lys Glu Ala 
275 280 285 

Val Gly Leu Cys Lys Ala Gin lie Thr Ala lie lie Ser Gin Gin Gly 
290 295 300 

Asp lie Phe Val Phe Asp Leu Glu Thr Ser Ala Val Ala Pro Phe Val 
305 310 315 320 

Trp Leu Asp Val Gly Ser lie Pro Gly Arg Phe Ser Asp Asn Gly Phe 
325 330 335 

Leu Met Thr Glu Lys Thr Arg Thr lie Leu Phe Tyr Pro Trp Glu Pro 
340 345 350 

Thr Ser Lys Asn Glu Leu Glu Gin Ser Phe His Val Thr Ser Leu Thr 
355 360 365 
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Asp lie Tyr 
370 



<210> 111 
<211> 114 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (38) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (114) 

<223> Xaa equals stop translation 
<400> 111 

Met Arg Pro Leu Leu Leu Gly Gly Tyr Trp Val Leu Cys Leu Ser Val 
• 1 5 10 15 

Leu Gly His Ala Ala Leu Tyr His Phe Trp Leu Arg Glu Glu Gly Lys 
20 25 30 

Gly Pro Pro Gin Val Xaa Ser Val Leu Ala Leu Ala Leu Pro Ala Gly 
35 40 45 

Ser Cys Ala Pro Gly Leu Pro Phe Pro Gly Pro Leu lie Pro Thr Gin 
50 55 60 

Leu Leu Phe Ala Leu Glu Trp Gly Thr Pro Thr Pro Leu Arg Asp His 
65 70 75 80 

Pro Pro His Ser Met His Ser Ala Pro Gin Asn Pro Pro Val Phe Leu 
85 90 95 

Gly Thr His Thr Cys Pro Pro Ser Trp Tyr Phe Arg Leu lie Pro Gin 
100 105 110 

Ala Xaa 



<210> 112 
<211> 152 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (152) 

<223> Xaa equals stop translation 
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<400> 112 

Met Arg Arg Leu Leu Leu Val Thr Ser Leu Val Val Val Leu Leu Trp 
15 10 15 

Glu Ala Gly Ala Val Pro Ala Pro Lys Val Pro lie Lys Met Gin Val 
20 25 30 

Lys His Trp Pro Ser Glu Gin Asp Pro Glu Lys Ala Trp Gly Ala Arg 
35 40 45 

Val Val Glu Pro Pro Glu Lys Asp Asp Gin Leu Val Val Leu Phe Pro 
50 55 60 

Val Gin Lys Pro Lys Leu Leu Thr Thr Glu Glu Lys Pro Arg Gly Gin 
65 70 75 80 

Gly Arg Gly Pro He Leu Pro Gly Thr Lys Ala Trp Met Glu Thr Glu 
85 90 95 

Asp Thr Leu Gly Arg Val Leu Ser Pro Glu Pro Asp His Asp Ser Leu 
100 105 HO 

Tyr His Pro Pro Pro Glu Glu Asp Gin Gly Glu Glu Arg Pro Arg Leu 
115 120 125 

Trp Val Met Pro Asn His Gin Val Leu Leu Gly Pro Glu Glu Asp Gin 
130 135 140 

Asp His He Tyr His Pro Gin Xaa 
145 150 



<210> 113 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (56) 

<223> Xaa equals stop translation 
<400> 113 

Met Pro Cys Gly Lys Phe Leu Phe Pro Val Ser Pro Val Ser Ser Leu 
15 io 15 

Ser Leu His Trp Ser Ala Val Leu Leu Leu Leu Leu Ala Asp Phe Pro 
20 25 30 

Arg Val His Gly Ser Pro Pro Gly Val Ser Arg Val Ser He Leu His 
35 40 45 

Cys Leu Phe Pro Phe Leu Ser Xaa 
50 55 
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<210> 114 
<211> 237 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (237) 

<223> Xaa equals stop translation 
<400> 114 

Met Glu Val Arg Leu lie Phe Leu Ser Gly Leu Cys lie Ala Val Ala 
15 10 15 

Val Val Trp Ala Val Phe Arg Asn Glu Asp Arg Trp Ala Trp lie Leu 
20 25 30 

Gin Asp lie Leu Gly lie Ala Phe Cys Leu Asn Leu lie Lys Thr Leu 
35 40 45 

Lys Leu Pro Asn Phe Lys Ser Cys Val lie Leu Leu Gly Leu Leu Leu 
50 55 60 

Leu Tyr Asp Val Phe Phe Val Phe lie Thr Pro Phe lie Thr Lys Asn 
65 70 75 80 

Gly Glu Ser lie Met Val Glu Leu Ala Ala Gly Pro Phe Gly Asn Asn 
85 90 95 

Glu Lys Leu Pro Val Val lie Arg Val Pro Lys Leu lie Tyr Phe Ser 
100 105 110 

Val Met Ser Val Cys Leu Met Pro Val Ser lie Leu Gly Phe Gly Asp 
115 120 125 

lie lie Val Pro Gly Leu Leu lie Ala Tyr Cys Arg Arg Phe Asp Val 
130 135 140 

Gin Thr Gly Ser Ser Tyr lie Tyr Tyr Val Ser Ser Thr Val Ala Tyr 
145 150 155 160 

Ala lie Gly Met lie Leu Thr Phe Val Val Leu Val Leu Met Lys Lys 
165 170 175 

Gly Gin Pro Ala Leu Leu Tyr Leu Val Pro Cys Thr Leu lie Thr Ala 
180 185 190 

Ser Val Val Ala Trp Arg Arg Lys Glu Met Lys Lys Phe Trp Lys Gly 
195 200 205 

Asn Ser Tyr Gin Met Met Asp His Leu Asp Cys Ala Thr Asn Glu Glu 
210 215 220 



BNSDOCID: <WO 9947540A1_I_> 



WO 99/47540 



PCT/US99/05804 



69 



Asn Pro Val lie Ser Gly Glu Gin lie Val Gin Gin Xaa 
225 230 235 



<210> 115 

<211> 44 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (44) 

<223> Xaa equals stop translation 



<400> 115 
Met Phe Cys Phe 
1 

Leu Asn Pro Leu 
20 

Val Phe Leu Phe 

35 



Tyr Leu His Phe 
5 

Leu Phe Phe Ser 

Pro Asp Tyr His 
40 



He Phe His Val 
10 

Cys Ser Cys Phe 
25 

Leu Gly Met Xaa 



Leu Ser Tyr Lys 
15 

Cys Phe He Leu 
30 



<210> 116 

<211> 65 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 

<222> (65) 

<223> Xaa equals stop translation 

<400> 116 

Met Val Arg His He Arg Glu Arg Arg Arg Gin Pro Leu Ala Phe Gin 
1 5 10 15 

Arg Val Leu Leu Ser Leu Cys Leu Leu Glu Gly He Trp His Ser Pro 
20 25 30 

Ala Ala Ala Ala Gly Gly Gly Ser His Cys Ser Ser Trp Pro Ser Leu 
35 40 45 

Tyr Thr Thr Phe Gin Arg Val Ser Leu Leu Glu Leu Asp Leu Gly Leu 
50 55 60 



Xaa 
65 



<210> 117 
<211> 118 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> SITE 
<222> (118) 

<223> Xaa equals stop translation 
<400> 117 

Met Ala Arg Ser Ala Leu Arg Leu Glu lie Leu Gly Gin Leu Leu Val 
1 5 10 15 

Gly Val Ser Ser Cys Cys Ala Glu lie Arg Ser Arg Ser Tyr Leu Gly 
20 25 30 

Phe Cys Trp Lys Asn lie Gin Asp Glu Arg Lys Lys Lys lie lie Leu 
35 40 45 

Arg Gly Ser Arg Asn Leu Leu Cys Pro Arg Leu Leu Arg Pro Leu Glu 
50 55 60 

Pro Val Gin Ala Lys Gly Thr Gin Ser Val Asp Pro Arg Glu Val Val 
65 70 75 80 

Arg Glu Thr Arg Ser Met Ser Thr Leu Pro Ala Asp Phe Cys Leu Leu 
85 90 95 

Pro Gin Ala Ser Arg Met Ala Gin Lys Gly Ser Pro Ser Arg Ser Ser 
100 105 110 

Leu Gin Leu Leu Phe Xaa 
115 



<210> 118 

<211> 65 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (65) 

<223> Xaa equals stop translation 
<400> 118 

Met Thr Val Ser Leu Phe Leu Leu Leu Ala Thr Ser Gin Ser Gin Asp 
15 10 15 

Gly Cys Cys Asp Ser Gly Ser Cys Pro Asn Ser Arg Gin Gin Glu Gly 
20 25 30 

His Gly Ala Ala Pro Ala Ser Arg Cys Pro Cys Arg Pro Ser Leu Gin 
35 40 45 

Ala Gin Glu Pro Lys Glu Glu Ser Thr Gin Met Trp Cys Ser His Leu 
50 55 60 
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Xaa 
65 



<210> 119 

<211> 43 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (43) 

<223> Xaa equals stop translation 
<400> 119 

Met Leu Lys Trp Thr Gly Phe Leu Val Val Leu Val Ala Phe Lys Lys 
1 .5 10 15 

lie Ser Ala Ser Phe Gin Val Asn Tyr Asn Leu Lys Phe Glu lie Ser 
20 25 30 

Phe Gly Glu Pro Trp Lys Phe Thr Gin Trp Xaa 
35 40 



<210> 120 

<211> 48 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (48) 

<223> Xaa equals stop translation 
<400> 120 

Met Ser Phe Gly lie Ser lie His Thr Cys Thr Tyr Leu lie Phe lie 
15 10 15 

Ala Phe His Phe lie Ala Leu Cys Lys Val Thr Phe Phe Thr Asp Ser 
20 25 30 

Arg Phe Gly Asn Pro Met Ser He Ser Leu Ser Ala Pro Phe Phe Xaa 
35 40 45 



<210> 121 
<211> 140 
<212> PRT 
<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (140) 

<223> Xaa equals stop translation 
<400> 121 

Met Ala Leu Gly lie Gin Lys Arg Phe Ser Pro Glu Val Leu Gly Leu 
1 5 10 15 

Cys Ala Ser Thr Ala Leu Val Trp Val Val Met Glu Val Leu Ala Leu 
20 25 30 

Leu Leu Gly Leu Tyr Leu Ala Thr Val Arg Ser Asp Leu Ser Thr Phe 
35 40 45 

His Leu Leu Ala Tyr Ser Gly Tyr Lys Tyr Val Gly Met lie Leu Ser 
50 55 60 

Val Leu Thr Gly Leu Leu Phe Gly Ser Asp Gly Tyr Tyr Val Ala Leu 
65 70 75 80 

Ala Trp Thr Ser Ser Ala Leu Met Tyr Phe lie Val Arg Ser Leu Arg 
85 90 95 

Thr Ala Ala Leu Gly Pro Asp Ser Met Gly Gly Pro Val Pro Arg Gin 
100 105 110 

Arg Leu Gin Leu Tyr Leu Thr Leu Gly Ala Ala Ala Phe Gin Pro Leu 
115 120 125 

lie lie Tyr Trp Leu Thr Phe His Leu Val Arg Xaa 
130 135 140 



<210> 122 

<211> 92 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (89) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (92) 

<223> Xaa equals stop translation 
<400> 122 

Met Met Asp Phe Leu Arg Cys Val Thr Ala Ala Leu lie Tyr Phe Ala 
15 10 15 

lie Ser lie Thr Ala lie Ala Lys Tyr Ser Asp Gly Ala Ser Lys Ala 
20 25 30 
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Ala Gly Gly Ser Val Pro Asp Thr 
35 40 

Glu Met Gly Arg Glu Leu Gly Ala 
50 55 

Ser Pro Val Met His Pro lie His 
65 70 

Leu Leu Pro Ser Cys Leu Gin Leu 
85 



Arg Ala Val Cys Pro Ser Arg Ser 
45 

Ala Ala Ser Arg Glu Gin Gly Val 
60 

Pro Val His Arg Cys Leu Ala Ser 
75 80 

Xaa Ser Thr Xaa 
90 



<210> 123 
<211> 347 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (242) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (246) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (347) 

<223> Xaa equals stop translation 
<400> 123 

Met Arg Arg Gly Ala Gly Ala Ala Arg Gly Arg Ala Ser Trp Cys Trp 
1 5 10 15 

Ala Leu Ala Leu Leu Trp Leu Ala Val Val Pro Gly Trp Ser Arg Val 
20 25 30 

Ser Gly He Pro Ser Arg Arg His Trp Pro Val Pro Tyr Lys Arg Phe 
35 40 45 

Asp Phe Arg Pro Lys Pro Asp Pro Tyr Cys Gin Ala Lys Tyr Thr Phe 
50 55 60 

Cys Pro Thr Gly Ser Pro He Pro Val Met Glu Gly Asp Asp Asp He 
65 70 75 80 

Glu Val Phe Arg Leu Gin Ala Pro Val Trp Glu Phe Lys Tyr Gly Asp 
85 90 95 

Leu Leu Gly His Leu Lys He Met His Asp Ala He Gly Phe Arg Ser 
100 105 no 
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Thr Leu Thr Gly Lys 
115 

Leu Gly Asn Cys Thr 
130 

Phe Trp Cys Asm Gin 
145 

Val His Trp Lys Glu 
165 

Gly Asn Met Phe Asn 
180 

Thr Gly lie Tyr Tyr 
195 

Gly Ala Glu Thr Trp 
210 

Arg Thr Phe Asn Lys 
225 

Glu Xaa Asn Tyr Thr 
245 

Leu Gly Asn Glu Thr 
260 

Gly Leu Ala lie Lys 
275 

Thr Lys Glu Phe Leu 
290 

Val His Lys Gin Phe 
305 

Pro Met Lys Phe Pro 
325 

Pro lie Arg Asn Lys 
340 



Asn Tyr Thr Met Glu 
120 

Phe Pro His Leu Arg 
135 

Gly Ala Ala Cys Phe 
150 

Asn Gly Thr Leu Val 
170 

Gin Met Ala Lys Trp 
185 

Glu Thr Trp Asn Val 
200 

Phe Asp Ser Tyr Asp 
215 

Leu Ala Glu Phe Gly 
230 

Xaa lie Phe Leu Tyr 
250 

Ser Val Phe Gly Pro 
265 

Arg Phe Tyr Tyr Pro 
280 

Leu Ser Leu Leu Gin 
295 

Tyr Leu Phe Tyr Asn 
310 

Phe lie Lys lie Thr 
330 

Thr Leu Ser Gly Leu 
345 



Trp Tyr Glu Leu Phe Gin 
125 

Pro Glu Met Asp Ala Pro 
140 

Phe Glu Gly lie Asp Asp 

155 160 

Gin Val Ala Thr lie Ser 
175 

Val Lys Gin Asp Asn Glu 
190 

Lys Ala Ser Pro Glu Lys 
205 

Cys Ser Lys Phe Val Leu 
220 

Ala Glu Phe Lys Asn lie 
235 240 

Ser Gly Glu Pro Thr Tyr 
255 

Thr Gly Asn Lys Thr Leu 
270 

Phe Lys Pro His Leu Pro 
285 

lie Phe Asp Ala Val lie 
300 

Phe Glu Tyr Trp Phe Leu 
315 320 

Tyr Glu Glu lie Pro Leu 
335 

Xaa 



<210> 124 
<211> 234 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (173) 
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<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (234) 

<223> Xaa equals stop translation 
<400> 124 

Met His Arg Gly Lys Leu Asp Cys Ala Gly Gly Ala Leu Leu Ser Ser 
1 5 10 15 

Tyr Leu lie Val Leu Met lie Leu Leu Ala Val Val lie Cys Thr Val 
20 25 30 

Ser Ala He Met Cys Val Ser Met Arg Gly Thr He Cys Asn Pro Gly 
35 40 45 

Pro Arg Lys Ser Met Ser Lys Leu Leu Tyr He Arg Leu Ala Leu Phe 
50 55 60 

Phe Pro Glu Met Val Trp Ala Ser Leu Gly Ala Ala Trp Val Ala Asp 
65 70 75 80 

Gly Val Gin Cys Asp Arg Thr Val Val Asn Gly He He Ala Thr Val 
85 90 95 

Val Val Ser Trp He He He Ala Ala Thr Val Val Ser He He He 
100 105 no 

Val Phe Asp Pro Leu Gly Gly Lys Met Ala Pro Tyr Ser Ser Ala Gly 
115 120 125 

Pro Ser His Leu Asp Ser His Asp Ser Ser Gin Leu Leu Asn Gly Leu 
130 135 140 

Lys Thr Ala Ala Thr Ser Val Trp Glu Thr Arg He Lys Leu Leu Cys 
I 45 150 155 160 

Cys Cys He Gly Lys Asp Asp His Thr Arg Val Ala Xaa Ser Ser Thr 
165 170 175 

Ala Glu Leu Phe Ser Thr Tyr Phe Ser Asp Thr Asp Leu Val Pro Ser 
180 185 190 

Asp He Ala Ala Gly Leu Ala Leu Leu His Gin Gin Gin Asp Asn He 
195 200 205 

Arg Asn Asn Gin Asp Leu Pro Arg Trp Ser Ala Met Pro Gin Gly Ala 
210 215 220 

Pro Arg Lys Leu He Trp Met Gin Asn Xaa 
225 230 



<210> 125 
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<211> 54 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (54) 

<223> Xaa equals stop translation 
<400> 125 

Met Gin Gly Val Leu Phe Gly Phe Val Trp Leu Phe Ser Phe Leu Trp 
15 10 15 

Gin Glu Asn Lys Ser Ser Ala Ser Pro Ser Thr Leu Ala Lys Ser Gly 
20 25 30 

Ser Pro Cys Pro Val Ser lie Pro Trp Met Pro Gly Val Leu Val Arg 
35 40 45 

Phe Phe Thr Leu Leu Xaa 
50 



<210> 126 

<211> 82 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (44) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (82) 

<223> Xaa equals stop translation 
<400> 126 

Met Arg Met Arg Val Ala Val Ala Pro Arg Pro His Gin His Leu Val 
15 10 15 

Val Ser Val Ser Trp lie Leu Ala lie Leu lie Ser Val Ser Gly Tyr 
20 25 30 

His Cys Phe His Leu Gin Phe Ser Tyr Met Val Xaa Asn lie Phe Pro 
35 40 45 

His Val Tyr Leu Ser Ser Ala Tyr Leu Leu Arg Pro Val lie Cys Ser 
50 55 60 

Asp Leu Leu Pro Val Phe Val Cys Leu His Val Cys Leu Cys Leu lie 
65 70 75 80 

Phe Xaa 
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<210> 127 
<211> 42 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (42) 

<223> Xaa equals stop translation 



<400> 127 
Met Gly Trp Glu 
1 

Pro Trp Cys Thr 
20 

Gly Leu Glu Arg 
35 



Ala Ala Leu Ala 
5 

lie Gin Arg Pro 

Arg Ser Lys Gly 
40 



Leu Leu Val Ser 
10 

Asp Val Gly Thr 
25 

Phe Xaa 



Ala Val Phe Phe 
15 

Thr Ser Pro Gly 
30 



<210> 128 

<211> 66 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (66) 

<223> Xaa equals stop translation 



<400> 128 
Met Thr Phe Met 
1 

Asn Arg Leu lie 
20 

His Asn Gly Trp 
35 

Tyr Phe Ser Leu 
50 



lie Leu Lys Phe 
5 

Ala Arg Gin Leu 

lie Pro Lys Ser 
40 

lie Pro Thr Gly 
55 



Phe Phe Leu Cys 
10 

Ala Lys lie His 
25 

Asn Leu Trp Leu 



Phe Ala Asp Glu 
60 



Gly Phe Val Leu 
15 

Ala He His Ala 
30 

Lys Met Gly Lys 
45 

Asp He Asn Lys 



Arg Xaa 
65 



<210> 129 
<211> 50 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (50) 

<223> Xaa equals stop translation 



<400> 129 

Met lie Val Asn His Phe Ser Phe 
1 5 

Phe Leu Leu Gin His Ser Cys Phe 
20 

Asp Ser Leu Cys His Cys Phe Leu 
35 40 

Gin Xaa 
50 



Leu Phe Cys Trp lie Val Phe Cys 
10 15 

Cys Cys Ala Tyr Phe Trp Ser Phe 
25 30 

Ser His Thr Pro Leu Arg Phe Thr 
45 



<210> 130 
<211> 227 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (227) 

<223> Xaa equals stop translation 
<400> 130 

Met Glu Thr Val Val lie Val Ala lie Gly Val Leu Ala Thr lie Phe 
15 10 15 

Leu Ala Ser Phe Ala Ala Leu Val Leu Val Cys Arg Gin Arg Tyr Cys 
20 25 30 

Arg Pro Arg Asp Leu Leu Gin Arg Tyr Asp Ser Lys Pro lie Val Asp 
35 40 45 

Leu lie Gly Ala Met Glu Thr Gin Ser Glu Pro Ser Glu Leu Glu Leu 
50 55 60 

Asp Asp Val Val lie Thr Asn Pro His lie Glu Ala lie Leu Glu Asn 
65 70 75 80 

Glu Asp Trp lie Glu Asp Ala Ser Gly Leu Met Ser His Cys lie Ala 
85 90 95 

lie Leu Lys lie Cys His Thr Leu Thr Glu Lys Leu Val Ala Met Thr 
100 105 110 

Met Gly Ser Gly Ala Lys Met Lys Thr Ser Ala Ser Val Ser Asp lie 
115 120 125 
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He Val Val Ala 
130 

Ser Met Tyr Pro 
145 

Ala Leu Leu Leu 



Cys His Leu Thr 
180 

Ala Glu Glu His 
195 

Pro Asp Lys Gly 
210 

Ala He Xaa 
225 



Lys Arg lie Ser 
135 

Pro Leu Asp Pro 
150 

Ser Val Ser His 
165 

Gly Gly Leu Asp 



Leu Glu Val Leu 
200 

Leu Pro Gly Pro 
215 



Pro Arg Val Asp 
140 

Lys Leu Leu Asp 
155 

Leu Val Leu Val 
170 

Trp He Asp Gin 
185 

Arg Glu Ala Ala 



Glu Gly Phe Leu 
220 



Asp Val Val Lys 



Ala Arg Thr Thr 
160 

Thr Arg Asn Ala 
175 

Ser Leu Ser Ala 
190 

Leu Ala Ser Glu 
205 

Gin Glu Gin Ser 



<210> 131 
<211> 118 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (118) 

<223> Xaa equals stop translation 
<400> 131 

Met Gin Arg lie Ala Ser Leu Leu Thr Leu Leu Thr Gin Leu Thr Leu 
15 10 15 

Ala Ala Gly Ser Thr Pro Ala Glu Thr He Ser Asp Ser Ala Glu Ala 
20 25 30 



Ser Leu Ser Ala Thr Pro Ser Leu 
35 40 

Leu Gin Pro Leu Val Glu Pro Cys 
50 55 

Ser Arg Pro Glu Met Trp Arg Ala 
65 70 

Leu Leu Phe Leu Gly Ala Tyr Tyr 
85 

Ser Cys Pro Glu Asp Trp Leu Gin 
100 

Cys Cys Cys His Cys Xaa 



Val Thr Trp Thr Gin Val Ser Gly 
45 

Leu Arg Gin Thr Leu Lys Leu Leu 
60 

Val Gly Pro Val Pro Val Ala Cys 
75 80 

Gin Ala Trp Ser Gin Gin Pro Ser 
90 95 

Asp Met Glu Arg Leu Ser Glu Ser 
105 110 
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115 



<210> 132 
<211> 306 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (180) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (197) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (306) 

<223> Xaa equals stop translation 
<400> 132 

Met Ser Glu Asp Arg Pro Met Leu Gin Phe Leu Leu His Thr Ser Phe 
15 10 15 

Leu Ser Pro Leu Phe lie Leu Trp Leu Trp Thr Lys Pro lie Ala Arg 
20 25 30 

Asp Phe Leu His Gin Pro Pro Phe Gly Glu Thr Arg Phe Ser Leu Leu 
35 40 45 

Ser Asp Ser Ala Phe Asp Ser Gly Arg Leu Trp Leu Leu Val Val Leu 
50 55 60 

Cys Leu Leu Arg Leu Ala Val Thr Arg Pro His Leu Gin Ala Tyr Leu 
65 70 75 80 

Cys Leu Ala Lys Ala Arg Val Glu Gin Leu Arg Arg Glu Ala Gly Arg 
85 90 95 

lie Glu Ala Arg Glu lie Gin Gin Arg Val Val Arg Val Tyr Cys Tyr 
100 105 110 

Val Thr Val Val Ser Leu Gin Tyr Leu Thr Pro Leu lie Leu Thr Leu 
115 120 125 

Asn Cys Thr Leu Leu Leu Lys Thr Leu Gly Gly Tyr Ser Trp Gly Leu 
130 135 140 

Gly Pro Ala Pro Leu Leu Ser Pro Arg Pro lie Leu Ser Gin Arg Cys 
145 150 155 160 

Pro His Arg Leu Trp Gly Gly Arg Ser Pro Ala Asp Cys Ser Ala Asp 
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Cys Arg Gly Xaa 
180 

Pro Gly Leu Pro 
195 

Pro Phe Arg Pro 
210 

Cys Arg Pro Ser 
225 

Leu Ala Cys Pro 



Gly Pro Asp Ser 
260 

Pro Pro Trp Thr 
275 

Ser Ser Met Arg 
290 

Val Xaa 
305 



165 

Gly Trp Pro Ala 



Xaa Leu Val Asp 
200 

Leu Leu Pro Pro 
215 

Trp Gly Pro Glu 
230 

Leu Cys Leu Arg 
245 

Pro Ala Phe Pro 



Pro Ser Phe Cys 
280 

Val Pro Arg Pro 
295 



170 

Tyr Ser Pro Leu 
185 

Gly Cys Leu Pro 



Ala Leu Gly Arg 
220 

Val Cys Ser Trp 
235 

Pro Arg Val Pro 
250 

Ser Pro Gin Cys 
265 

Leu Arg Thr Val 



Leu Ser Pro Lys 
300 



175 

Pro Pro Trp Arg 
190 

Ala Ala Arg Gin 
205 

Leu Leu Ala Ala 



Gly Ser Gly Thr 
240 

Ser Cys Lys Val 
255 

Leu Thr Arg Gly 
270 

Ser Pro Gly Pro 
285 

Arg Met Cys Gin 



<210> 133 
<211> 45 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (45) 

<223> Xaa ecjuals stop translation 
<400> 133 

Met Ser Tyr Ser Leu Phe Leu Ala Leu Leu Ser Phe Ala Ser Ala lie 
15 10 15 

Leu Phe Val Ala Gly Thr lie Ala Gly Thr Gly Gly Leu Ser Phe His 
20 25 30 

Gly lie Ala Thr lie Phe Val Leu Thr Gly Lys Trp Xaa 
35 40 45 



<210> 134 
<211> 44 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (6) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (44) 

<223> Xaa equals stop translation 
<400> 134 

Met Gly Arg Leu Gly Xaa Gin Cys Leu Leu Phe Leu Ala Phe Lys Ala 
15 10 15 

lie Ser Gly Val Phe Phe Leu Phe Trp Arg Pro Ala Asp Ser Thr Glu 
20 25 30 

Arg Asn Thr Gin Ser Trp Asp Phe Pro Pro Leu Xaa 
35 40 



<210> 135 

<211> 50 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (50) 

<223> Xaa equals stop translation 
<400> 135 

Met Gly Val Gly Val Leu Arg lie Leu Leu Ser Cys Leu Gly Glu Ala 
15 10 15 

Ala Pro Lys Ser Ala Gly Thr Ser Leu Glu Ser Ala Lys Glu Cys Trp 
20 25 30 

Ser Ala Ala Thr Leu Leu Val Leu Cys Val Leu Cys Gin Leu Gin His 
35 40 45 

Gly Xaa 
50 



<210> 136 

<211> 81 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (81) 

<223> Xaa equals stop translation 
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<400> 136 
Met Glu Ser Leu 
1 

Ser Leu Leu Ala 
20 

Asn Ser Gin Phe 
35 

lie Ala Gin Val 
50 

Arg Val Leu Gin 
65 

Xaa 



Pro Glu Asn Lys 
5 

lie lie Gly Leu 

Gly Leu Val Asp 
40 

Leu Leu Leu Asp 
55 

Phe Phe Leu Gly 
70 



Pro Leu Val Trp 
10 

Leu Leu Gly Ser 
25 

He Pro Val Glu 



Phe Cys Leu Ala 
60 

Thr Pro Lys Leu 
75 



Ser Leu Ala Val 
15 

Ser Pro Asp Phe 
30 

Phe Lys Leu Val 
45 

Leu Leu Ala Asp 



Lys Val Pro Ser 
80 



<210> 137 
<211> 277 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (94) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (103) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (277) 

<223> Xaa equals stop translation 



<400> 137 

Met He His Val Asn Arg Asn He 
1 5 

Phe Val Ala Gly Val Phe Leu Phe 
20 

Lys Pro Tyr Phe Leu Leu Leu Leu 

35 40 

Asp He Val Phe Val Leu Leu Leu 
50 55 

Ala Pro Phe Gly Ala Leu Met Val 



Met Asp Phe Lys Leu Phe Leu Val 
10 15 

Phe Tyr Ala Arg Thr Leu Glu Ser 
25 30 

Gly Asn Cys Ala Arg Cys Ser Asn 
45 

Val Lys Arg Phe He Arg Ser He 
60 

Gly Cys Trp Phe Ala Ser Val Tyr 
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65 70 75 80 

lie Val Cys Gin Leu Met Glu Asp Leu Lys Trp Leu Trp Xaa Glu Asn 
85 90 95 

Arg lie Tyr Val Ser Gly Xaa Val Leu lie Val Gly Phe Phe Ser Phe 
100 105 HO 

Val Val Cys Tyr Lys His Gly Pro Leu Ala His Asp Arg Ser Arg Ser 
115 120 125 

Leu Leu Met Trp Met Leu Arg Leu Leu Ser Leu Val Leu Val Tyr Ala 
130 135 140 

Gly Val Ala Val Pro Gin Phe Ala Tyr Ala Ala lie lie Leu Leu Met 
145 150 155 160 

Ser Ser Trp Ser Leu His Tyr Pro Leu Arg Ala Cys Ser Tyr Met Arg 
165 170 175 

Trp Lys Met Glu Gin Trp Phe Thr Ser Lys Glu Leu Val Val Lys Tyr 
180 185 190 

Leu Thr Glu Asp Glu Tyr Arg Glu Gin Ala Asp Ala Glu Thr Asn Ser 
195 200 205 

Ala Leu Glu Glu Leu Arg Arg Ala Cys Arg Lys Pro Asp Phe Pro Ser 
210 215 220 

Trp Leu Val Val Ser Arg Leu His Thr Pro Ser Lys Phe Ala Asp Phe 
225 230 235 240 

Val Leu Gly Gly Ser His Leu Ser Pro Glu Glu lie Ser Leu His Glu 
245 250 255 

Glu Gin Tyr Gly Leu Gly Gly Ala Phe Leu Glu Glu Gin Leu Phe Asn 
260 265 270 

Pro Ser Thr Ala Xaa 
275 



<210> 138 

<211> 57 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (57) 

<223> Xaa equals stop translation 
<400> 138 

Met Cys Gin Thr Leu Pro Ala Arg Leu Arg Ala Gin Cys lie Ser Ser 
15 10 15 
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Leu Leu Phe Leu Leu Met Gly Leu Leu Ala Met Thr Gly Glu Arg Asn 
20 25 30 

Gin Gly Thr His Tyr Tyr Glu Phe Ser Gly Phe lie Phe Lys Ser Gin 
35 40 45 

Met Met Trp Ser lie Lys Pro Asn Xaa 
50 55 



<210> 139 
<211> 71 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (71) 

<223> Xaa equals stop translation 
<400> 139 

Met Tyr Leu Trp Phe Ser Phe Ser Thr Val Gly Leu Cys Gly Cys Cys 
15 10 15 

Leu Leu Tyr Arg Ala Cys Gly Phe He Trp Tyr Leu Leu Leu Leu Gly 
20 25 30 

His Ser Ser Thr Asn Ser Leu Gin Asp Gly Gly Ala Glu Arg Pro Glu 
35 40 45 

His Pro Trp Ala His Val Arg Tyr Ser Cys Arg Arg Glu Leu Ser Phe 
50 55 60 

Trp Phe Tyr Val Phe Asn Xaa 
65 70 



<210> 140 

<211> 63 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (63) 

<223> Xaa equals stop translation 
<400> 140 

Met Glu Pro Glu Ser Trp Ala Leu Cys Leu Leu Leu Phe Leu Gly Thr 
1 5 10 15 



Ala Leu Gly Tyr Pro Pro Leu Pro Arg His Ser Ser Lys Cys Glu He 
20 25 30 
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Leu Glu Val Arg Leu His Leu Leu Pro Leu Leu He Asn He Gly Met 

35 40 45 

Met Ser Pro Val Ala Ser Pro Phe Val Cys Ser He Thr Gly Xaa 
50 55 60 



<210> 141 
<211> 89 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (89) 

<223> Xaa equals stop translation 
<400> 141 

Met Leu Phe Leu Ser Ala Ser He Cys Thr Ser Ala Leu Phe Leu Cys 
.1 5 10 15 

Leu Ser Arg Leu Thr He Ser Ala Pro His Pro Ala Trp Trp Gly Arg 
20 25 30 

Met Pro Thr His Thr Ser Pro Gly His Leu Leu Glu Leu Gin Pro Arg 
35 40 45 

Gly Met Thr Glu Ser He Leu Phe Ser He Ser Ala Leu Val Ser Asn 
50 55 60 

Ser Trp Gly Lys Met Thr Gin Leu Thr Ser Gly Ser His Ser Trp Ser 
65 70 75 80 

Ser Gly Leu Gin Asn Phe Gin Ala Xaa 
85 



<210> 142 
<211> 46 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (46) 

<223> Xaa equals stop translation 
<400> 142 

Met Arg Pro Val Cys Ser Leu Gly Trp Ala Gly Trp Pro Gly Leu Val 
15 10 15 

Cys Gly Leu Arg Ala Leu Leu Gly Pro Ser Leu Phe Pro Val Thr Phe 
20 25 30 

Gly Ala Thr Glu Ala Val His Ser Leu Asp Val Cys Ser Xaa 
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35 40 45 



<210> 143 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (56) 

<223> Xaa equals stop translation 
<400> 143 

Met Val Asn Glu Lys Glu Ala Arg Thr Gly Ser Pro Lys Ser Trp Leu 
15 10 15 

Leu Cys Leu Ala Leu Leu Leu He Lys Tyr Val Thr Phe Cys Lys Pro 
20 25 30 

Tyr Leu Thr Lys Pro Tyr Phe Leu His Leu Ser Val Leu Asp Gin Leu 
35 40 45 

Ser Pro Gly Thr Pro Leu Asp Xaa 
50 55 



<210> 144 
<211> 77 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (77) 

<223> Xaa equals stop translation 
<400> 144 

Met Phe He Ala He Tyr Phe Lys Ala Phe His Gly Ser Phe Gin Leu 
1 5 10 15 

Cys Thr Trp Leu Val He Met He Val He Leu Gly Gin Ser Phe Ser 
20 25 30 

Ala Leu Ala Leu Leu Thr Phe Trp Leu He Leu Cys Cys Arg Gly Cys 
35 40 45 

Pro Val His Cys Arg Val Phe Ser Ser lie Pro Asp Leu Tyr Leu Leu 
50 55 60 

Asn Ala Arg Ser Asn . Thr Val Pro Pro Ala Gin Leu Xaa 
65 70 75 



<210> 145 
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<211> 43 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (43) 

<223> Xaa equals stop translation 
<400> 145 

Met Phe Phe Leu Ser Met Phe Leu His lie Val Leu Leu His Cys Gly 
15 10 15 

Asn Ser Phe Tyr Lys lie Cys His Ser Trp Asp Tyr Ala Ala Leu Gin 
20 25 30 

Glu Ser Thr Arg Phe Tyr Ser Asn Ser Tyr Xaa 
35 40 



<210> 146 
<211> 102 
<212> PRT 
<213> Homo sapiens 

<220> ■ 
<221> SITE 
<222> (67) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (102) 

<223> Xaa equals stop translation 
<400> 146 

Met Glu Leu Glu Arg Cys Ser Val Val Leu Cys lie Leu Ala Asn Leu 
15 10 15 

Ala Val Leu Arg Ala Leu Phe Leu Pro Cys lie lie Phe His Cys Val 
20 25 30 

Ser Asp Ser Arg Ser Val Asn Arg Glu Thr Lys Val Lys Phe .Val His 
35 40 45 

Thr Ser Val His Gly Val Gly His Ser Phe Val Gin Ser Ala Phe Lys 
50 55 60 

Ala Phe Xaa Leu Val Pro Pro Glu Ala Val Pro Glu Gin Lys Asp Pro 
65 70 75 80 

Asp Pro Glu Phe Pro Thr Val Lys Tyr Pro Asn Pro Glu Glu Gly Lys 
85 90 95 

Gly Val Leu Val Thr Xaa 
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100 



<210> 147 
<211> 134 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (134) 

<223> Xaa equals stop translation 
<400> 147 

Met Arg Val Pro Leu Val Leu Ser Trp Ala Phe Val Leu Val Gly Phe 
15 10 15 

Ser Gly Val Tyr Leu Ala Ser Glu Ser Phe Trp Phe Pro Pro Ser Leu 
20 25 30 

Cys Asp Leu Thr Ser Pro Pro Gly Leu His Leu Trp Lys Phe lie Arg 
35 40 45 

Asp Leu Val Ser Met Glu Glu Leu Thr Asp Ser Ala Arg Glu Met Gly 
50 55 60 

Tyr Trp Met Met Val Phe Ser Leu Lys Ala Met Phe Pro Val Ser Ser 
65 70 75 80 

Gly Cys Phe Gin Glu Arg Gin Glu Thr Asn Lys Ser Leu Thr Leu Leu 
85 90 95 

Arg Cys Ser Gin Arg Asp Thr Ser Pro Leu Met Asp Gly Gin Thr Trp 
100 105 110 

Ala Arg Val Arg Val Thr Lys Pro Pro Thr Thr Ala Thr Ala Ala Tyr 
115 120 125 

Asn Arg His lie Arg Xaa 
130 



<210> 148 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (50) 

<223> Xaa equals stop translation 
<400> 148 

Met Lys Ser Leu Phe Cys lie Tyr Phe Leu Arg Trp Pro Met Gly Leu 
15 10 15 
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Ser Trp Gly Glu Thr Phe lie Leu Leu Arg Asp Ser Leu Ala lie Asn 
20 25 30 

Phe Gin Ser Phe Ser Lys Ala Ala Ser Gly Asp lie Phe Gly Cys His 
35 40 45 

Asp Xaa 
50 



<210> 149 

<211> 64 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (6) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (64) 

<223> Xaa equals stop translation 
<400> 149 

Met Ser Cys Gly Leu Xaa Phe Gly Pro Trp Phe Val Pro Met Leu Leu 
15 10 15 

Met Ser His Ser Leu Leu Pro Ser Trp Ser Gly Leu Trp Val Thr Thr 
20 25 30 

Trp Asn Gly Ser Ser Gly Glu Arg Thr Pro Ser Pro Trp Arg Arg Lys 
35 40 45 

Arg Ala Ser Gin Ser Ala Gly Arg lie Ala Ser Trp Met Ser Phe Xaa 
50 55 60 



<210> 150 

<211> 75 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (59) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
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<222> (75) 

<223> Xaa equals stop translation 
<400> 150 

Met Leu Ser Ser Pro Asn Leu Ala Ala Ser Leu Leu Cys Leu Trp His 
1 5 10 15 

Ser Gly Ser Ala Thr Asn Trp Ala Pro Pro Cys Ala Gly Met Trp Ala 
20 25 30 

Ser Arg Cys Gly Trp Lys Val Ser Pro His Pro Glu Ala Gly Pro Cys 
35 40 45 

Ser Ser Ala Leu Trp Val Ser Cys Cys Val Xaa Ala Glu Gin Pro Gin 
50 55 60 

Pro Gly Gly Arg Glu Pro Arg His Arg Gly Xaa 
65 70 75 



<210> 151 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (55) 

<223> Xaa equals stop translation 
<400> 151 

Met Pro His lie Ser Phe Cys Leu Gly Thr Pro Tyr Val Val Ala Val 
15 10 15 

Tyr Leu Pro Ala Trp lie Val Met Leu Leu Leu Pro Gly Val Arg Pro 
20 25 30 

Tyr Ser Ser Leu Gin Ala Leu Lys His Pro Ser Cys Ser Ser Ser Ser 
35 40 45 

Val Cys Ala Pro Tyr Met Xaa 
50 55 



<210> 152 
<211> 58 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (58) 

<223> Xaa equals stop translation 
<400> 152 



BNSDOCID: <WO 9947540A1_I_> 



WO 99/47540 



PCT/US99/05804 



92 



Met Gly Leu Asn 
1 

Ala lie Ser Ala 
20 

Phe Leu lie Ser 
35 

Arg Gly Val Trp 
50 



lie Ser Pro Trp 
5 

Ala Phe lie Ser 



His Arg Ser Ser 
40 

Glu Asn Glu Glu 
55 



Cys Phe Leu Ala 
10 

Val Gly Val Val 
25 

Lys Asn Leu Arg 
lie Xaa 



lie Leu Thr Cys 
15 

Cys Trp Leu Leu 
30 

Lys Ser Arg Val 
45 



<210> 153 

<211> 53 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (53) 

<223> Xaa equals stop translation 
<400> 153 

Met Ala Tyr Val Leu Ala Val Leu Cys Phe Lys Ser Leu Trp Ala Leu 
15 10 15 

Phe Lys Pro Asn Lys Gin Leu lie Glu Phe Leu Leu Met Val Lys Val 
20 25 30 

Val Lys lie Pro Leu Cys Tyr Leu Arg Gin Leu Leu Gly Gly lie Lys 
35 40 45 

Thr Pro Arg Val Xaa 
50 



<210> 154 

<211> 51 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (51) 

<223> Xaa equals stop translation 
<400> 154 

Met Asp Gly Gly Pro Gly Ala Phe Ser Arg Ala Trp Val Leu Gin lie 
15 10 15 

Pro Trp Leu Leu Leu Ser Gly Gly Asn Phe Ala Leu Cys Glu Pro Arg 
20 25 30 

Pro Cys Pro Ser Ala Gly His Pro Trp Gin Glu Ala Gly Leu Pro Ser 
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35 40 45 



Ser Pro Xaa 
50 



<210> 155 

<211> 67 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (55) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (67) 

<223> Xaa equals stop translation 
<400> 155 

Met Pro Phe Leu Ser Val Trp Phe Phe Asn Leu Gly Leu lie Phe Gly 
1 5 10 15 

Val Glu Ser Phe Val Leu Arg Ala Val Leu Phe lie Ala Gly Cys Ser 
20 25 30 

Ala Thr Ser Gin Met Glu Ala Ala Ser Pro Tyr Pro Ala Val Thr Lys 
35 40 45 

Arg Lys Lys Asn Val Ser Xaa His Cys Gin lie Ser Ser Gly Gly Ala 
50 55 60 

Pro Gly Xaa 
65 



<210> 156 
<211> 49 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (49) 

<223> Xaa equals stop translation 
<400> 156 

Met Leu Leu Lys Arg Asn Leu Leu lie Leu lie Leu Phe Leu Val Thr 
15 10 15 

Cys Phe Asn Phe Val Ser Phe Phe Phe Phe Pro Trp Lys Leu Leu Gly 
20 25 30 
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Ser Pro Phe Tyr Pro Cys Ser Leu Arg Ser Asp Asn Asp Gly Cys Val 
35 40 45 

Xaa 



<210> 157 
<211> 61 
<212> PRT 

<213> Homo sapiens 
<400> 157 

Met Gly Ser Phe Leu His Pro Gin Trp His Leu Leu lie Thr Phe Cys 
15 10 15 

Ala Val Leu Gly Lys Gly Leu His Ser Asp Pro Ser Arg Pro Phe Glu 
20 25 30 

His Gly Gly Ala Leu Gly Lys Val Pro Arg Gly Arg Ser Thr Leu Leu 
35 40 45 

Ser Lys Glu Val Leu Leu Lys Lys Lys Lys Lys Lys Arg 
50 55 60 



<210> 158 
<211> 118 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (113) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (118) 

<223> Xaa equals stop translation 
<400> 158 

Met Leu Leu Trp Trp Gin Cys Leu Cys Cys His Ala Val Leu Glu Pro 
15 10 15 

Ala Ala Thr Ala Met Pro Glu Asp Ala Ala Pro Ser Ser Leu Pro Val 
20 25 30 

Pro Pro Asn Met Thr Ser Ser Arg Phe His Tyr Phe Trp Thr Leu Leu 
35 40 45 

Gin lie Lys Leu Thr Gin Phe Tyr Ser Lys Pro Arg Ser Leu Ser Ala 
50 55 60 

Thr Pro Glu Lys Asn lie Gly Leu Gin Glu Pro Glu Arg Arg Glu Arg 
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65 70 75 80 

Phe Thr Gly Glu Ser Cys Arg Trp Glu Leu Lys Ala Lys Ser Cys Leu 
85 90 95 

Cys Pro Thr Arg Asn Ser Leu Gly Cys Thr Gin Cys His Cys Asp Gly 
100 105 110 

Xaa Lys lie Cys Asn Xaa 
115 



<210> 159 
<211> 151 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (151) 

<223> Xaa equals stop translation 
<400> 159 

Met Leu Ala Val Leu Ala Phe Pro Val Gly Val Phe Val Val Ala Val 
15 10 15 

Phe Trp lie lie Tyr Ala Tyr Asp Arg Glu Met lie Tyr Pro Lys Leu 
20 25 30 

Leu Asp Asn Phe lie Pro Gly Trp Leu Asn His Gly Met His Thr Thr 
35 40 45 

Val Leu Pro Phe lie Leu lie Glu Met Arg Thr Ser His His Gin Tyr 
50 55 60 

Pro Ser Arg Ser Ser Gly Leu Thr Ala lie Cys Thr Phe Ser Val Gly 
65 70 75 80 

Tyr lie Leu Trp Val Cys Trp Val His His Val Thr Gly Met Trp Val 
85 90 95 

Tyr Pro Phe Leu Glu His lie Gly Pro Gly Ala Arg lie lie Phe Phe 
100 105 110 

Gly Ser Thr Thr lie Leu Met Asn Phe Leu Tyr Leu Leu Gly Glu Val 
115 120 125 

Leu Asn Asn Tyr lie Trp Asp Thr Gin Lys Ser Met Glu Glu Glu Lys 
130 135 140 

Glu Lys Pro Lys Leu Glu Xaa 
145 150 



<210> 160 
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<211> 92 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (92) 

<223> Xaa equals stop translation 
<400> 160 

Met Gly Asp Lys Leu Gly Met Ala Arg Ala Pro Ser Val Ala Leu Ala 
15 10 15 

Gin Leu Trp Leu lie Cys Leu Cys Pro Glu Ser Leu Ala Ser Phe Val 
20 25 30 

Gin Ala Val Pro Trp Lys Val Leu Gin Pro Ser Ser Asn Arg Ser Thr 
35 40 45 

Asp Cys Ser Pro His Met Arg Pro Thr Cys Glu Thr Leu Gly Ser Arg 
50 55 60 

Lys Ala Gin Asp Leu Val Leu Asp Thr Met Cys Leu Ser Thr Asp Asp 
65 70 75 80 

Cys Gin Gly Leu lie Cys Arg Gly His Arg Ser Xaa 
85 90 



<210> 161 

<211> 42 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (42) 

<223> Xaa equals stop translation 
<400> 161 

Met Gin Val Ala Cys Val Met Lys Val Ser Ala Gin Trp Val Cys Phe 
15 10 15 

Phe Val Val Phe Ser Pro Leu Cys Ser Ser Val Lys Cys Ala Ser Ser 
20 25 30 

Gly Gin Asn Arg Gly Arg Gly Asp Gin Xaa 
35 40 



<210> 162 

<211> 78 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (78) 

<223> Xaa equals stop translation 
<400> 162 

Met Met Leu Gin lie lie His Leu Asn Thr Leu lie Lys Phe Phe Gin 
15 10 15 



Cys Leu Lys Leu Phe Leu His Gly 
20 

Leu Ala Tyr Lys Phe Ser Gin Phe 

35 40 

Lys Lys Val His His Leu Leu Ser 
50 55 

Ser Gin Ala Asp Asn Ser Ser Trp 
65 70 



Thr Ala Gly Ser Gly Gin Lys Cys 
25 30 

Pro Ser lie lie Pro Ala Ala His 
45 

Pro Lys Cys Leu Pro Thr Glu Cys 
60 

Asp Ser Ala Val Trp Xaa 
75 



<210> 163 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (55) 

<223> Xaa equals stop translation 



<400> 163 

Met Lys Arg Leu Trp Cys Leu Ser 
1 5 

Pro Ser Val Leu Ser Ser Val Phe 
20 

His Trp Thr Cys Ser Gin Val Ser 
35 40 

Phe lie Leu Phe Ser Gly Xaa 
50 55 



Trp Val Pro Gly Leu Gin Gly Ser 
10 15 

Phe Ser Val Phe Lys Pro Gin Leu 
25 30 

Ser His Trp His Pro Pro Cys Leu 
45 



<210> 164 
<211> 90 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (90) 

<223> Xaa equals stop translation 
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<400> 164 

Met Lys Phe Leu Leu Ala Ala Leu 
1 5 

Ser Ser Gin Tyr lie Lys Trp lie 
20 

Ser Glu Phe Ser Phe Val Leu Gly 
35 40 

lie Ser Arg Glu Val Tyr Leu Leu 
50 55 

Leu Leu Leu Ala Pro Val Leu Trp 
65 70 

Pro Arg Pro Glu Arg Arg Ser Ser 
85 



Val Leu Ser Leu lie Leu Pro Arg 
10 15 

Val Ser Ala Gly Leu Ala Gin Val 
25 30 

Ser Arg Ala Arg Arg Ala Gly Val 
45 

lie Leu Ser Val Thr Thr Leu Ser 
60 

Arg Ala Ala lie Thr Arg Cys Val 
75 80 

Leu Xaa 
90 



<210> 165 

<211> 45 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (45) 

<223> Xaa equals stop translation 
<400> 165 

Met Phe Val Trp His Leu Lys Val Met Val Met Phe lie lie Leu Tyr 
15 10 15 

Phe Ala Tyr Cys Glu Ser Asn Phe His Ser Val Leu Ser Val Ser Lys 
20 25 30 

Pro Leu Leu Lys lie Leu Phe Leu Pro Arg Asn Leu Xaa 
35 40 45 



<210> 166 

<211> 45 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (45) 

<223> Xaa ecjuals stop translation 
<400> 166 

Met Thr Pro Gly Cys Ser Val Pro Phe Leu Leu Cys Trp Leu Phe Ala 
15 10 15 
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Leu Met Met Gin Glu Lys Trp Gly 
20 

His Tyr Ser Arg Gin Trp His Gin 
35 40 



Gly Val Lys Ser Leu Val Ser Tyr 
25 30 

Thr Val Val Val Xaa 
45 



<210> 167 

<211> 66 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (66) 

<223> Xaa equals stop translation 



<400> 167 

Met Ser lie Ala Leu Arg lie Asn Arg Leu His Phe Trp Val Leu Leu 
15 10 15 

Phe Phe Phe Phe Phe Ala Gin Leu Ser Leu Ser Val Asp Leu His Gly 
20 25 30 



Thr Ser Tyr Ser 
35 

Leu Glu Lys Leu 
50 

lie Xaa 
65 



Leu Lys Ser Leu 
40 

Asp Val Gly Pro 
55 



Ser Tyr Leu Thr 



Tyr Glu Lys lie 
60 



lie Phe Leu Asp 
45 

lie Arg Asn Gin 



<210> 168 

<211> 62 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (62) 

<22 3> Xaa equals stop translation 
<400> 168 

Met Gin Leu Thr Leu Gly Gly Ala Ala Val Gly Ala Gly Ala Val Leu 
15 10 15 

Ala Ala Ser Leu Leu Trp Ala Cys Ala Val Gly Leu Tyr Met Gly Gin 
20 25 30 

Leu Glu Leu Asp Val Glu Leu Val Pro Glu Asp Asp Gly Thr Ala Ser 
35 40 45 
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Ala Glu Gly Pro Asp Glu Ala Gly Arg Pro Pro Pro Glu Xaa 
50 55 60 



<210> 169 

<211> 47 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (47) 

<223> Xaa equals stop translation 
<400> 169 

Met His Thr Ala Lys Met Ser Leu Leu Asn Ser Val Cys Leu Leu Val 
15 10 15 

Leu Ser lie Trp Tyr Val Val Lys Phe Pro Met Met Arg Asp Ser Thr 
20 25 30 

lie Asn Val Pro Tyr Leu Leu Arg Leu Lys Ala lie Thr Thr Xaa 
35 40 45 



<210> 170 

<211> 106 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (69) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (106) 

<223> Xaa equals stop translation 
<400> 170 

Met Ser Gly Leu Ala Ala Ala Ala His Val Phe Arg Val Cys Leu Phe 
15 10 15 

Pro Leu Ser Trp Gly Ser . Ser Lys Thr Thr Phe lie His Gly Leu Ser 
20 25 30 

Ser Tyr lie Ala Thr Pro Val Leu Asn Ser lie Phe Ser Ser Trp Lys 
35 40 45 

Ser Arg Arg Lys Asp Thr Trp Thr Cys Leu Leu His Arg Leu Ser Ala 
50 55 60 

Phe Pro lie Ser Xaa Arg Arg Arg Asn Phe Ala Leu Phe Ser His Ser 
65 70 75 80 
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Cys Val Cys lie Arg Ser Ser Ser Asp Asp Val Gly Pro Thr Met Tyr 
85 90 95 

Ser Phe Ser Val Pro Cys Arg Val Lys Xaa 
100 105 



<210> 171 
<211> 45 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (45) 

<223> Xaa equals stop translation 
<400> 171 

Met His Leu Leu Thr Leu Phe Ser 
1 5 

Ser Thr Pro Leu Ser Phe Cys Asp 
20 

Leu Glu Phe Pro Val Glu Thr Ser 
35 40 



er Gly Leu lie Phe Leu Gly Cys 
10 15 

Cys Leu Pro lie Leu Leu Leu Trp 
25 30 

Gly Val Cys Ser Xaa 
45 



<210> 172 
<211> 47 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (47) 

<223> Xaa equals stop translation 
<400> 172 

Met lie Leu Lys His Tyr lie Leu Thr Phe lie Phe Leu Phe lie Phe 
15 10 15 

Leu Phe Phe Met Leu Asn lie Leu His Ser Asn Ser Asn Leu lie Asp 
20 25 30 

Leu Leu Lys Gly Asn lie Arg Phe Arg Leu Leu Asn Ser Met Xaa 
35 40 45 



<210> 173 

<211> 42 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (42) 

<223> Xaa equals stop translation 
<400> 173 

Met Ala Thr Leu Gin lie Thr Thr Ala Met Lys lie Thr Met Met lie 
15 10 15 

Thr Met Val Met lie lie Thr Thr lie Val Glu Ala Met Lys lie Pro 
20 25 30 

Thr Thr Ala Met Met Met Ala Met Gin Xaa 
35 40 



<210> 174 
<211> 47 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (47) 

<223> Xaa equals stop translation 
<400> 174 

Met Glu Met Leu Ser Ser Lys Trp Ser Lys Arg Val Ala Ala Ser Leu 
15 10 15 

Ala His Leu lie Ser Leu Phe lie Gly Leu Leu Phe Leu Leu Leu Gly 
20 25 30 

Ser Ser Val Tyr Pro Gly Thr Glu Thr Leu Phe Pro Lys Ser Xaa 
35 40 45 



<210> 175 

<211> 41 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (41) 

<223> Xaa equals stop translation 
<400> 175 

Met Trp Pro Ser Leu Gly Arg Cys Cys Leu Phe Phe Cys Leu Leu Thr 
15 10 15 

Asn Leu Thr Ser Cys His Thr Ser Gin lie Thr Leu Cys Ser Arg Glu 
20 25 30 

Thr Cys Val Trp Ser Arg Thr Thr Xaa 
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35 40 



<210> 176 

<211> 53 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (53) 

<223> Xaa equals stop translation 
<400> 176 

Met Tyr Leu Met Ser Phe Ser lie His Phe Val Lys lie lie Cys Met 
15 10 15 

Cys Thr lie Leu Val Leu Ser Pro Pro Val Leu Leu Lys Tyr Gin Asp 
20 25 30 

Ser Thr Pro Arg Pro Leu Trp Ser Gin Cys Lys lie Pro lie Asn Tyr 
35 40 45 

Leu Lys Gly Lys Xaa 
50 



<210> 177 
<211> 250 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (250) 

<223> Xaa equals stop translation 
<400> 177 

Met Arg Gly Pro Ser Trp Ser Arg Pro Arg Pro Leu Leu Leu Leu Leu 
1 5 10 15 

Leu Leu Leu Ser Pro Trp Pro Val Trp Ala Gin Val Ser Ala Arg Ala 
20 25 30 

Ser Pro Ser Gly Ser Leu Gly Ala Pro Asp Cys Pro Glu Val Cys Thr 
35 40 45 

Cys Val Pro Gly Gly Leu Pro Ala Val Gly Thr Leu Ala Ala Arg Arg 
50 55 60 

Ala Pro Gly Pro Glu Pro Ala Pro Ala Arg Ala Ala Ala Gly Pro Gin 
65 70 75 80 

Pro Arg Pro Cys Ala Ala Ala Arg Cys Leu Arg Gly Ser Gly Arg Ala 
85 90 95 
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Thr Ala Pro Gly 
100 

Ser Leu Leu Gly 
115 

Pro Ala Gly Ser 
130 

Ala Gin Pro Leu 
145 

Gly Ala Arg Arg 



Arg Ala Gly Gly 
180 

Arg Arg Ala Ala 
195 

Ala Pro Ala Leu 
210 

Gly Arg Asp Gly 
225 

Pro Asp Cys Leu 



Pro Ala Arg Glu 



Pro Gly Arg Ala 
120 

Thr Gly Thr Arg 
135 

lie Gly Arg Gin 
150 

Ala Pro Ala Ala 
165 

Thr Arg Ala Gly 



Pro Ala Arg Gin 
200 

Arg Leu Ala Ala 
215 

Ala Leu Arg Val 
230 

Phe Arg Arg Arg 
245 



Arg Ala Ala Leu 
105 

Ala Ala Ala Gly 



Asp Phe Arg Ala 
140 

Pro Ala Gly Ala 
155 

Ala Leu Thr Gin 
170 

Ala Ala Gly Pro 
185 

Pro Leu Gly Leu 



Pro Ala Pro Ala 
220 

Ala Gly Thr Pro 
235 

Leu Xaa 
250 



Gly Ala Cys Ala 
110 

Pro Glu Arg Gin 
125 

Ala Ala Arg Ala 



Pro Gly Ala Arg 
160 

Pro Ala Gly Gin 
175 

Pro Ala Arg Ser 
190 

Arg Val Arg Ala 
205 

Ala Arg Val Arg 



Asp Ala Gin Pro 
240 



<210> 178 

<211> 148 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (148) 

<223> Xaa equals stop translation 
<400> 178 

Met Leu Ala Gly Ala Gly Arg Pro Gly Leu Pro Gin Gly Arg His Leu 
15 10 15 

Cys Trp Leu Leu Cys Ala Phe Thr Leu Lys Leu Cys Gin Ala Glu Ala 
20 25 30 

Pro Val Gin Glu Glu Lys Leu Ser Ala Ser Thr Ser Asn Leu Pro Cys 
35 40 45 

Trp Leu Val Glu Glu Phe Val Val Ala Glu Glu Cys Ser Pro Cys Ser 
50 55 60 
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Asn Phe Arg Ala Lys Thr Thr Pro 
65 70 

Glu Lys lie Thr Cys Ser Ser Ser 
85 

Arg Phe Ser Phe Glu Trp Asn Asn 
100 

Ala Val Val Cys Val Ala Leu lie 
115 120 

Gin Arg Gin Leu Asp Arg Lys Ala 
130 135 

Glu Ser lie Xaa 
145 



105 



Glu Cys Gly Pro Thr Gly Tyr Val 
75 80 

Lys Arg Asn Glu Phe Lys Ser Cys 
90 95 

Ala Tyr Phe Gly Ser Ser Lys Gly 
105 HO 

Phe Ala Cys Leu Val lie lie Arg 
125 

Leu Glu Lys Val Arg Lys Gin lie 
140 



<210> 179 
<211> 48 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (48) 

<223> Xaa equals stop translation 
<400> 179 

Met Phe Met Cys Arg Leu Leu Leu Trp Ala Thr Gly Ala Tyr Gly Phe 
1 5 10 15 

Leu Gly Asp Asp Val Glu Tyr Thr Ser Val Leu Pro His Gin Lys Gly 
20 25 30 

Lys Glu Ala Trp Val Phe lie Cys Gin Leu Pro Phe lie lie Gly Xaa 
35 40 45 



<210> 180 
<211> 57 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (56) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
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<222> (57) 

<22 3> Xaa equals stop translation 
<400> 180 

Met Leu Gin Thr Leu Leu Cys Leu Trp Gin Tyr Thr Ser Ala Gin Val 
1 5 10 15 

Leu Lys Met Leu Cys lie His Arg Gin Lys Trp Asp Asn Phe Trp Ala 
20 25 30 

Val Val Met lie Asn Leu Leu lie Arg lie Gin Arg Leu Pro Phe Ser 
35 40 45 

Leu Pro lie Ala Leu Arg Val Xaa Xaa 
50 55 



<210> 181 

<211> 49 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (49) 

<223> Xaa equals stop translation 
<400> 181 

Met Pro Ser Glu Gly Arg Leu Val Leu Leu Ser Ala Phe Cys Pro Ala 
15 10 15 

Phe Phe Pro Pro Trp Val Leu Ser Gly Ser Phe Ala Phe Ser Leu Cys 
20 25 30 

Ala Glu Ser His Leu Asn Ser Ser His Arg Arg lie Ala Val Trp Thr 
35 40 45 

Xaa 



<210> 182 

<211> 46 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (46) 

<223> Xaa equals stop translation 
<400> 182 

Met Val Gin Trp Lys Asn Trp Pro Glu Ser Leu Glu Val Trp Val Leu 
1 5 10 ' 15 
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Val Leu Ala Val Pro Leu Thr His Cys Asp Leu Gly lie Leu Cys Cys 
20 25 30 

Glu Asp lie Ser Gin Val Leu His Val Ser Gin Gin lie Xaa 
35 40 45 



<210> 183 

<211> 41 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (41) 

<223> Xaa equals stop translation 



<400> 183 
Met Ala Leu Gly 
1 

Ser Ser Val Thr 
20 

Leu His Gly Thr 
35 



Leu Cys Ser Ser 
5 

Cys Leu Ala lie 

Ser Gly Leu Gly 
40 



Gly Ala Leu Ser 
10 

Met Val Leu Met 
25 

Xaa 



Thr Leu Cys Leu 
15 

Ala Val Asp Gly 
30 



<210> 184 

<211> 80 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (80) 

<223> Xaa equals stop translation 



<400> 184 
Met Thr Leu Met 
1 

Arg Ser Lys Glu 
20 

Trp Cys Ser Pro 
35 

Cys Ala Ala Ser 
50 

Leu Gly Val Gin 
65 



Cys Leu Cys Leu 
5 

Arg Leu Ser Gly 

Ala Ser Glu Ser 
40 

Gly Ser His Pro 
55 

Leu Ala Ala Leu 
70 



Ser Val Thr Val 
10 

Thr Phe Cys Gly 
25 

Ser Ser Pro Gly 



Asp Cys Pro Leu 
60 

Gly Arg Pro Gin 
75 



Leu His Pro Leu 
15 

Tyr Ser Ser Ser 
30 

Ser Leu Leu Thr 
45 

Ser Gin Arg Leu 



Gly Leu Phe Xaa 
80 
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<210> 185 

<211> 47 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (47) 

<223> Xaa equals stop translation 
<400> 185 

Met Lys Ser Gin Cys Tyr Ser Pro Ser Tyr Phe Ala Phe Phe Cys Leu 
15 10 15 

Val Phe Phe Gin lie Thr Ser Ala Ser Ser Gin Thr Leu Arg Gly His 
20 25 30 

Val Leu Cys Arg Thr Thr Leu Arg Asp Ser Ser Ala Tyr Cys Xaa 
35 40 45 



<210> 186 
<211> 141 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (36) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (141) 

<223> Xaa equals stop translation 
<400> 186 

Met Phe Leu Phe Gly Gly Phe Leu Met Thr Leu Phe Gly Leu Phe Val 
15 10 15 

Ser Leu Val Phe Leu Gly Gin Ala Phe Thr lie Met Leu Val Tyr Val 
20 25 30 

Trp Ser Arg Xaa Asn Pro Tyr Val Arg Met Asn Phe Phe Gly Leu Leu 
35 40 45 

Asn Phe Gin Ala Pro Phe Leu Pro Trp Val Leu Met Gly Phe Ser Leu 
50 55 60 

Leu Leu Gly Asn Ser lie lie Val Asp Leu Leu Gly lie Ala Val Gly 
65 70 75 80 
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His lie Tyr Phe Phe 
85 

lie Arg lie Leu Lys 
100 

Pro Asp Glu Asp Pro 
115 

Gly Phe Ala Trp Gly 
130 



Leu Glu Asp Val Phe Pro 
90 

Thr Pro Ser lie Leu Lys 
105 

Asn Tyr Asn Pro Leu Pro 
120 

Glu Gly Gin Arg Leu Gly 
135 



Asn Gin Pro Gly Gly 
95 

Ala lie Phe Asp Thr 
110 

Glu Glu Arg Pro Gly 
125 

Gly Xaa 
140 



<210> 187 
<211> 339 
<212> PRT 
< 2 1 3 > Homo s ap i ens 



<220> 

<221> SITE 
<222> (339) 

<223> Xaa equals stop translation 



<400> 187 
Met Arg Lys Pro 
1 

Leu Leu Pro Leu 
20 

Thr Pro Gly Ser 
35 

Leu Leu Thr Pro 
50 

Thr His Gly Cys 
65 

Asn His Gly Leu 



Ala Ser Trp Phe 
100 

Asn His Val Tyr 
115 

lie Leu Ser Pro 
130 

Ser Pro Thr Thr 
145 

Glu Arg Gin Thr 



Ala Ala Gly Phe 
5 

Ala Pro Ala Ala 



Pro Leu Ser Pro 
40 

Thr Trp Lys Ala 
55 

Arg Asn Pro Thr 
70 

Val Pro Asp Gly 
85 

Glu Ser Phe Cys 



Tyr Ala Lys Arg 
120 

Asn Thr Leu Lys 
135 

Met Thr Ser Pro 
150 

Phe Gin Pro Trp 



Leu Pro Ser Leu 
10 

Ala Gin Asp Ser 
25 

Thr Glu Tyr Glu 



Glu Thr Thr Cys 
60 

Leu Val Gin Leu 
75 

Ala Val Cys Ser 
90 

Gin Phe Thr His 
105 

Val Leu Cys Ser 



Glu lie Glu Ala 
140 

lie Ser Pro His 
155 

Pro Glu Arg Leu 



Leu Lys Val Leu 
15 

Thr Gin Ala Ser 
30 

Arg Phe Phe Ala 
45 

Arg Leu Arg Ala 



Asp Gin Tyr Glu 
80 

Asn Leu Pro Tyr 
95 

Tyr Arg Cys Ser 
110 

Gin Pro Val Ser 
125 

Ser Ala Glu Val 



Phe Thr Val Thr 
160 

Ser Asn Asn Val 
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165 170 175 

Glu Glu Leu Leu Gin Ser Ser Leu Ser Leu Gly Ser Gin Glu Gin Ala 
180 185 190 

Pro Glu His Lys Gin Glu Gin Gly Val Glu His Arg Gin Glu Pro Thr 
195 200 205 

Gin Glu His Lys Gin Glu Glu Gly Gin Lys Gin Glu Glu Gin Glu Glu 
210 215 220 

Glu Gin Glu Glu Glu Gly Lys Gin Glu Glu Gly Gin Gly Thr Lys Glu 
225 230 235 240 

Gly Arg Glu Ala Val Ser Gin Leu Gin Thr Asp Ser Glu Pro Lys Phe 
245 250 255 

His Ser Glu Ser Leu Ser Ser Asn Pro Ser Ser Phe Ala Pro Arg Val 
260 265 270 

Arg Glu Val Glu Ser Thr Pro Met lie Met Glu Asn lie Gin Glu Leu 
275 280 285 

lie Arg Ser Ala Gin Glu lie Asp Glu Met Asn Glu lie Tyr Asp Glu 
290 295 300 

Asn Ser Tyr Trp Arg Asn Gin Asn Pro Gly Ser Leu Leu Gin Leu Pro 
305 310 315 320 

His Thr Glu Pro Cys Trp Cys Cys Ala lie Arg Ser Trp Arg lie Pro 
325 330 335 

Ala Ser Xaa 



<210> 188 
<211> 66 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (66) 

<223> Xaa equals stop translation 
<400> 188 

Met Gin Arg lie Pro Thr Ser Pro Arg Gin Ala Trp Trp Trp Thr Cys 
15 10 15 

Trp Ala Met Phe Gin Gly Pro Ala Ala Gly Ser Val Gly Ala Glu Arg 
20 25 30 

Lys Gly Glu Gly Cys Leu Phe Phe Gly Gin Asp Glu Ser Ser Arg Cys 
35 40 45 
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Gly Arg Ser Trp Pro Leu Ala Asp Pro Trp Val Tyr Arg Val Leu Arg 
50 55 60 

Ser Xaa 
65 



<210> 189 
<211> 360 
<212> PRT 

<213> Homo sapiens 



<400> 189 
Met Val Pro Ala 
1 

Gly Trp Trp Gin 
20 

Val Glu Val Ala 
35 

Ala His Pro Leu 
50 

Leu His Asp Pro 
65 

Val Leu Gly Leu 



lie Pro Gly Glu 
100 

Thr Cys Gly Ala 
115 

Ser Leu Phe Ser 
130 

Glu Glu Tyr Tyr 
145 

Thr Glu Asp Ser 



Cys Glu Glu Arg 
180 

Leu Asn Met Ser 
195 

Asp Cys Thr Leu 
210 



Ala Gly Arg Arg 
5 

Val Leu Leu Trp 



Glu Glu Ser Gly 
40 

Gin Val Gly Ala 
55 

Met Gly Gin Asp 
70 

Asp Thr Gin Gly 
85 

Ala Glu Asp Lys 



Gly Gly Ala Glu 
120 

Leu Asp Gly Ala 
135 

Thr Glu Pro Glu 
150 

Asn Asn Thr Glu 
165 

Asn lie Thr Gly 



Gin Asp Leu Met 
200 

Val Leu Phe Tyr 
215 



Pro Pro Arg Val 
10 

Val Leu Gly Leu 
25 

Arg Leu Trp Ser 



Val Tyr Leu Gly 
60 

Arg Ala Ala Glu 
75 

Asp His Met Val 
90 

Val Ser Ser Glu 
105 

Asp Ser Arg Cys 



Gly Ala His Phe 
140 

Val Ala Glu Ser 
155 

Ser Leu Lys Ser 
170 

Leu Glu Asn Phe 
185 

Asp Phe Leu Asn 



Thr Pro Trp Cys 
220 



Met Arg Leu Leu 
15 

Pro Val Arg Gly 
30 

Glu Glu Gin Pro 
45 

Glu Glu Glu Leu 



Glu Ala Asn Ala 
80 

Met Leu Ser Val 
95 

Pro Ser Gly Val 
110 

Asn Val Arg Glu 
125 

Pro Asp Arg Glu 



Asp Ala Ala Pro 
160 

Pro Lys Val Asn 
175 

Thr Leu Lys lie 
190 

Pro Asn Gly Ser 
205 

Arg Phe Ser Ala 



BNSDOCID: <WO 9947540A1_I_> 



WO 99/47540 



PCT/US99/05804 



112 



Ser Leu Ala Pro 
225 

His Phe Leu Ala 



Phe Gly Thr Val 
260 

Pro Met Ala Arg 
275 

lie Phe lie Phe 
290 

Val Thr Gin Ala 
305 

Ser Val Asp Trp 



lie Met Tyr Ala 
340 

Gly Gin Glu Gin 

355 



His Phe Asn Ser 
230 

Leu Asp Ala Ser 
245 

Ala Val Pro Asn 



Phe Asn His Thr 
280 

Asn Gin Thr Gly 
295 

Asp Gin lie Gly 
310 

Leu Leu Val Phe 
325 

Thr lie Arg Thr 



Glu His Val Glu 
360 



Leu Pro Arg Ala 
235 

Gin His Ser Ser 
250 

lie Leu Leu Phe 
265 

Asp Arg Thr Leu 



lie Glu Ala Lys 
300 

Pro Leu Pro Ser 
315 

Ser Leu Phe Phe 
330 

Glu Ser lie Arg 
345 



Phe Pro Ala Leu 
240 

Leu Ser Thr Arg 
255 

Gin Gly Ala Lys 
270 

Glu Thr Leu Lys 
285 

Lys Asn Val Val 



Thr Leu lie Lys 
320 

Leu lie Ser Phe 
335 

Trp Leu lie Pro 
350 



<210> 190 
<211> 160 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (160) 

<223> Xaa equals stop translation 
<400> 190 

Met Leu Leu Leu Leu lie Phe Trp I 
1 5 



Asn lie Met Val Tyr lie Ser lie 
20 

Val Pro Ser Thr Lys Gly lie Gly 

35 40 

Asn Asn Pro Ser Ser Gin Arg Ala 
50 55 



le Ala Pro Ala His Gly Pro Thr 
10 15 

Cys Ser Leu Leu Gly Ser Phe Thr 
25 30 

Leu Ala Ala Gin Asp lie Leu His 
45 

Leu Cys Leu Cys Leu Val Leu Leu 
60 



Ala Val Leu Gly Cys Ser lie lie Val Gin Phe Arg Tyr lie Asn Lys 
65 70 75 80 



BNSDOCID: <WO 9947540 A 1 _L> 



WO 99/47540 



113 



PCT/US99/05804 



Ala Leu Glu Cys 



Val Phe Thr Thr 
100 

Trp Ser Asn Val 
115 

Thr Thr Val Ser 
130 

Asn Phe Asn Leu 
145 



Phe Asp Ser Ser 
85 

Leu Val Leu Leu 



Gly Leu Val Asp 
120 

Val Gly lie Val 
135 

Gly Glu Met Asn 
150 



Val Phe Gly Ala 
90 

Ala Ser Ala lie 
105 

Phe Leu Gly Met 



Leu lie Gin Val 
140 

Lys Ser Asn Met 
155 



lie Tyr Tyr Val 
95 

Leu Phe Arg Glu 
110 

Ala Cys Gly Phe 
125 

Phe Lys Glu Phe 



Lys Thr Asp Xaa 
160 



<210> 191 

<211> 101 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (92) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (96) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (101) 

<223> Xaa equals stop translation 
<400> 191 

Met Phe Val Ala Val Phe Tyr Trp Val Leu Thr Val Phe Phe Leu lie 
15 10 15 

lie Tyr lie Thr Met Thr Tyr Thr Arg lie Pro Gin Val Pro Trp Thr 
20 25 30 

Thr Val Gly Leu Cys Phe Asn Gly Ser Ala Phe Val Leu Tyr Leu Ser 
35 40 45 

Ala Ala Val Val Asp Ala Ser Ser Val Ser Pro Glu Lys Asp Ser His 
50 55 60 

Asn Phe Asn Ser Trp Ala Ala Ser Ser Phe Phe Ala Phe Leu Val Thr 
65 70 75 80 
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lie Cys Tyr Ala Gly Asn Thr Tyr Phe Ser Phe Xaa Ala Trp Arg Xaa 
85 90 95 

Arg Thr lie Gin Xaa 
100 



<210> 192 

<211> 43 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (43) 

<223> Xaa equals stop translation 
<400> 192 

Met Phe Lys Leu Gin Leu Asp Leu Leu Thr Ala Val Asn Leu Val Tyr 
15 10 15 

Phe Ser Phe Leu Trp Val Val Ser Val Ala Asn Lys Met Asp Val Ser 
20 25 30 

Val Phe Glu Leu Val Asn Ser Asp Cys Phe Xaa 
35 40 



<210> 193 

<211> 62 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (62) 

<223> Xaa equals stop translation 
<400> 193 

Met Ser Val Cys Val Phe Leu Asp Phe Arg Leu lie Phe Trp Ser Phe 
15 10 15 

Cys Pro Cys Ser Ala Ser Pro Ser Arg His Phe Ala Ser Ser Ser Arg 
20 25 30 

Gly Gly Gly Gly Gly Ser Arg Asn Trp Val Gly Ala Gly Ala Ser Leu 
35 40 45 

Ala Ala Ser Leu Ala Leu Tyr Ala Leu Ser Pro Arg Arg Xaa 
50 55 60 



<210> 194 
<211> 53 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> SITE 
<222> (53) 

<223> Xaa equals stop translation 
<400> 194 

Met Gin Ala Gin lie Ser Ser Pro Arg Trp Thr Ser Trp Phe Ser Leu 
15 10 15 

Thr Ala Val Thr Leu Ala Phe Pro Ser Leu lie Pro Tyr Pro Ser Cys 
20 25 30 

Gly lie Pro Val Leu Thr Gin Asp Ala Lys Trp Pro Ser Asp Tyr Thr 
35 40 45 

Ser Pro Asp Ser Xaa 
50 



<210> 195 
<211> 186 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (114) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (186) 

<223> Xaa equals stop translation 
<400> 195 

Met Thr Leu Leu Asn Leu Leu Leu Gin Thr lie Phe Tyr Gly Val Thr 
15 10 15 

Cys Leu Asp Asp Val Leu Lys Arg Thr Lys Gly Gly Lys Asp lie Lys 
20 25 30 

Phe Leu Thr Ala Phe Arg Asp Leu Leu Phe Thr Thr Leu Ala Phe Pro 
35 40 45 

Val Ser Thr Phe Val Phe Leu Ala Phe Trp lie Leu Phe Leu Tyr Asn 
50 55 60 

Arg Asp Leu lie Tyr Pro Lys Val Leu Asp Thr Val lie Pro Val Trp 
65 70 75 80 

Leu Asn His Ala Met His Thr Phe lie Phe Pro He Thr Leu Ala Glu 
85 90 95 
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Val Val Leu Arg Pro His Ser Tyr Pro Ser Lys Lys Thr Gly Leu Thr 
100 105 110 

Leu Xaa Ala Ala Ala Ser lie Ala Tyr lie Ser Arg lie Leu Trp Leu 
115 120 125 

Tyr Phe Glu Thr Gly Thr Trp Val Tyr Pro Val Phe Ala Lys Leu Ser 
130 135 140 

Leu Leu Gly Leu Ala Ala Phe Phe Ser Leu Ser Tyr Val Phe lie Ala- 
145 150 155 160 

Ser lie Tyr Leu Leu Gly Glu Lys Leu Asn His Trp Lys Trp Gly Asp 
165 170 175 

Met Arg Gin Pro Arg Lys Lys Arg Lys Xaa 
180 185 



<210> 196 

<211> 77 

<212> PRT 

< 2 1 3 > Homo s api ens 

<220> 

<221> SITE 
<222> (77) 

<223> Xaa equals stop translation 
<400> 196 

Met Lys Asn Ala Thr Leu Leu Arg Met Val Leu Phe Val lie Asn Leu 
15 10 15 

Gin Asn Leu Lys Ser Cys Pro Val Leu His lie His Gin Asp Val Gin 
20 25 30 

Gin Gin Lys Arg Met Gly His Gly Gly Ser Ser Thr Arg Val Thr Val 
35 40 45 

Thr Ser Leu He Arg His Cys Thr Val Phe Gin Arg Pro Lys Asn Cys 
50 55 60 

Val Gin Asn Met He Thr Leu Gin Leu Ser Phe Pro Xaa 
65 70 75 



<210> 197 
<211> 175 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (175) 

<223> Xaa equals stop translation 
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<400> 197 

Met Phe Val Pro Ser 
1 5 

Leu Leu Gin Met Thr 
20 

Phe Glu Thr lie Leu 
35 

Arg Lys Arg His lie 
50 

Met Leu Leu Pro Ser 
65 

Leu His Pro Arg Pro 
85 

Gly Gly Glu Tyr Asp 
100 

lie Ser Ser Leu lie 
115 

Pro Leu Arg Pro Arg 
130 

Pro He Leu Pro Gly 
145 

Pro Ser Arg Gly Arg 
165 



Cys Leu Cys Leu Arg Phe 
10 

His Ser Cys Gly Gly Phe 
25 

Ser Glu Phe Lys Thr Gin 
40 

Gin Arg Lys Glu Ser Pro 
55 

Ser Thr His Thr He Pro 
70 75 

Phe Pro Ser Ser Arg Leu 
90 

Gin Arg Pro Thr Leu Pro 
105 

Pro Gly Pro Gly Glu Thr 
120 

Phe Asp Pro Val Gly Pro 
135 

Arg Gly Gly Pro Asn Asp 
150 155 

Pro Thr Asp Gly Arg Leu 
170 



Val Val Thr Ser Leu 
15 

Tyr He Cys Val He 
30 

He Gly Arg Leu Tyr 
45 

Lys Gly Arg Phe Val 
60 

Phe Tyr Pro Asn Pro 
80 

Pro Pro Gly He He 
95 

Tyr Val Gly Asp Pro 
110 

Pro Ser Gin Phe Pro 
125 

Leu Pro Gly Pro Asn 
140 

Arg Phe Pro Phe Arg 
160 

Ser Phe Met Xaa 
175 



<210> 198 
<211> 51 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (51) 

<223> Xaa equals stop translation 
<400> 198 

Met Gly Leu Lys Arg Lys Gin Gly Phe Val Phe Leu Phe Leu Leu Leu 
15 10 15 

Lys. Ser Thr Val Ala Ser Trp Leu Leu Ser Gly Val Gly Arg He Trp 
20 25 30 

Gly Leu Val His Phe Val Lys Val Asn His Val Cys Leu Asn Asn Arg 
35 40 45 
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Gly Val Xaa 
50 



<210> 199 
<211> 190 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (190) 

<223> Xaa equals stop translation 
<400> 199 

Met Gly Pro Val Arg Leu Gly lie Leu Leu Phe Leu Phe Leu Ala Val 
15 10 15 

His Glu Ala Trp Ala Gly Met Leu Lys Glu Glu Asp Asp Asp Thr Glu 
20 25 30 

Arg Leu Pro Ser Lys Cys Glu Val Cys Lys Leu Leu Ser Thr Glu Leu 
35 40 45 

Gin Ala Glu Leu Ser Arg Thr Gly Arg Ser Arg Glu Val Leu Glu Leu 
50 55 60 

Gly Gin Val Leu Asp Thr Gly Lys Arg Lys Arg His Val Pro Tyr Ser 
65 70 75 80 

Val Ser Glu Thr Arg Leu Glu Glu Ala Leu Glu Asn Leu Cys Glu Arg 
85 90 95 

lie Leu Asp Tyr Ser Val His Ala Glu Arg Lys Gly Ser Leu Arg Tyr 
100 105 110 

Ala Lys Gly Gin Ser Gin Thr Met Ala Thr Leu Lys Gly Leu Val Gin 
115 120 125 

Lys Gly Val Lys Val Asp Leu Gly lie Pro Leu Glu Leu Trp Asp Glu 
130 135 140 

Pro Ser Val Glu Val Thr Tyr Leu Lys Lys Gin Cys Glu Thr Met Leu 
145 150 155 160 

Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Gly Gly Asp Lys Met Thr 
165 170 175 

Lys Thr Gly Ser His Pro Lys Leu Asp Arg Glu Asp Leu Xaa 
180 185 190 



<210> 200 
<211> 80 
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<212> PRT 

<2 1 3 > Homo sapiens 
<220> 

<221> SITE 
<222> (80) 

<223> Xaa equals stop translation 
<400> 200 

Met Asn Tyr Ser Arg Ser Pro Trp Ala Ala Val Met Glu Pro Leu Thr 
15 10 15 



Leu Leu Phe Leu His Leu Ser Cys 
20 

Gly Trp Asp Ser Glu Cys Leu Val 

35 40 

Leu Arg Met Gin Ala Leu Leu Cys 

50 55 



Leu Leu Ser Leu Cys Glu Ala Val 
25 30 

Cys Ser Leu Gly Glu Glu Glu Phe 
45 

Gly Cys Arg Leu His Leu Gly Gly 
60 



Val Leu Tyr Val Cys Thr Leu Gly Thr Ala Cys lie Trp Lys lie Xaa 
65 70 75 80 



<210> 201 
<211> 106 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (106) 

<223> Xaa equals stop translation 
<400> 201 

Met Asn Leu Gly Val Ser Met Leu Arg lie Leu Phe Leu Leu Asp Val 
15 10 15 

Gly Gly Ala Gin Val Leu Ala Thr Gly Lys Thr Pro Gly Ala Glu lie 
20 25 30 

Asp Phe Lys Tyr Ala Leu lie Gly Thr Ala Val Gly Val Ala lie Ser 
35 40 45 

Ala Gly Phe Leu Ala Leu Lys lie Cys Met lie Arg Arg His Leu Phe 
50 55 60 

Asp Asp Asp Ser Ser Asp Leu Lys Ser Thr Pro Gly Gly Leu Ser Asp 
65 70 75 80 

Thr lie Pro Leu Lys Lys Arg Ala Pro Arg Arg Asn His Asn Phe Ser 
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85 90 95 



Lys Arg Asp Ala Gin Val lie Glu Leu Xaa 
100 105 



<210> 202 

<211> 80 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (80) 

<223> Xaa equals stop translation 
<400> 202 

Met Ala Cys Leu Gly Gly Leu Leu Gly 
1 5 



lie Ser Cys Leu Ser Pro Glu Met 
20 

Val Arg Asn Tyr Leu Gin Lys Pro 
35 40 

Pro Pro Leu lie Asn Leu Trp Glu 
50 55 

Leu Lys Val Lys Ala Thr Val lie 
65 70 



lie lie Gly Val lie Cys Leu 
10 15 

Asn Cys Asp Gly Gly His Ser Tyr 
25 30 

Thr Phe Ala Leu Gly Glu Leu Tyr 
45 

Ala Gly Lys Glu Lys Ser Thr Ser 
60 

Gly Leu Pro Thr Asn Met Ser Xaa 

75 80 



<210> 203 

<211> 58 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (58) 

<223> Xaa equals stop translation 
<400> 203 

Met Gly Leu Lys Leu Leu Gin Lys Pro Gly Ser Leu Lys Thr Leu lie 
15 10 15 

Ala lie lie Leu Val Met Tyr lie Phe Met Thr lie Ser Val lie Ala 
20 25 30 

Gly Thr Gly Lys Phe Ser Gin Lys Leu Asp Leu His Leu Asn Met Asp 
35 40 45 



BNSDOCID: <WO 9947540A1 I > 



WO 99/47540 



PCT/US99/05804 



121 



He Ser Pro Gly Arg Pro Ser Val His Xaa 
50 55 



<210> 204 
<211> 161 
<212> PRT 
<213> Homo sapiens 

<400> 204 

Met Asp Phe Leu Asn Pro Asn Gly Ser Asp Cys Thr Leu Val Leu Phe 
1 5 10 15 

Tyr Thr Pro Trp Cys Arg Phe Ser Ala Ser Leu Ala Pro His Phe Asn 
20 25 30 

Ser Leu Pro Arg Ala Phe Pro Ala Leu His Phe Leu Ala Leu Asp Ala 
35 40 45 

Ser Gin His Ser Ser Leu Ser Thr Arg Phe Gly Thr Val Ala Val Pro 
50 55 60 

Asn He Leu Leu Phe Gin Gly Ala Lys Pro Met Ala Arg Phe Asn His 
65 70 75 80 

Thr Asp Arg Thr Leu Glu Thr Leu Lys He Phe lie Phe Asn Gin Thr 
85 90 95 

Gly He Glu Ala Lys Lys Asn Val Val Val Thr Gin Ala Asp Gin He 
100 105 110 

Gly Pro Leu Pro Ser Thr Leu He Lys Ser Val Asp Trp Leu Leu Val 
115 120 125 

Phe Ser Leu Phe Phe Leu He Ser Phe He Met Tyr Ala Thr He Arg 
130 135 140 

Thr Glu Ser He Arg Trp Leu He Pro Gly Gin Glu Gin Glu His Val 
145 150 155 160 

Glu 



<210> 205 
<211> 137 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (10) 

<223> Xaa equals any of the naturally occurring L-amino acids 
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<400> 205 
lie Pro Glu Asn 
1 

Thr Ser Arg Thr 
20 

Val Ser Ser Ala 
35 

Ser Thr Ser Cys 
50 

Cys Thr Pro Ser 
65 

Leu Glu Leu Pro 



Thr Tyr Arg Cys 
100 

Ala Tyr Glu Met 
115 

Lys Phe Leu Leu 
130 



Arg Arg Pro Ala 
5 

Thr Thr Arg Arg 



Ser Val Ser Ser 
40 

Cys Arg Ser Ser 
55 

Ala Ser Thr Glu 
70 

Val Val His Thr 
85 

Ser Ala Gly Asp 



Gly Glu Glu Met 
120 

Phe His Phe Tyr 
135 



Ser Xaa Cys Thr 
10 

Pro Pro Trp Gly 
25 

Thr Arg Lys Thr 



Arg Arg Arg Val 
60 

Pro Ser Ala Arg 
75 

Phe Ser Phe Leu 
90 

Gly Ser He Thr 
105 

Pro Lys Arg Gin 



Leu 



Trp Ser Met Trp 
15 

Arg Phe Ser Ser 
30 

Trp Arg Thr Arg 
45 

Ala Ala Pro Phe 



Met Glu Pro Pro 
80 

Thr Phe Val Phe 
95 

Gin He Asn Cys 
110 

Met Lys Ala He 
125 



<210> 206 

<211> 41 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (10) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 206 

lie Pro Glu Asn Arg Arg Pro Ala Ser Xaa Cys Thr Trp Ser Met Trp 
15 10 15 

Thr Ser Arg Thr Thr Thr Arg Arg Pro Pro Trp Gly Arg Phe Ser Ser 
20 25 30 

Val Ser Ser Ala Ser Val Ser Ser Thr 
35 40 



<210> 207 

<211> 43 

<212> PRT 

<213> Homo sapiens 
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<400> 207 

Arg Lys Thr Trp Arg Thr Arg Ser Thr Ser Cys Cys Arg Ser Ser Arg 
15 10 15 

Arg Arg Val Ala Ala Pro Phe Cys Thr Pro Ser Ala Ser Thr Glu Pro 
20 25 30 

Ser Ala Arg Met Glu Pro Pro Leu Glu Leu Pro 
35 40 



<210> 208 
<211> 53 
<212> PRT 

<213> Homo sapiens 
<400> 208 

Val Val His Thr Phe Ser Phe Leu Thr Phe Val Phe Thr Tyr Arg Cys 
1 5 10 15 

Ser Ala Gly Asp Gly Ser lie Thr Gin lie Asn Cys Ala Tyr Glu Met 
20 25 30 

Gly Glu Glu Met Pro Lys Arg Gin Met Lys Ala lie Lys Phe Leu Leu 
35 40 45 

Phe His Phe Tyr Leu 
50 



<210> 209 
<211> 223 
<212> PRT 

<213> Homo sapiens 
<400> 209 

His Pro Ser lie lie lie Trp Ser Gly Asn Asn Glu Asn Glu Glu Ala 
15 10 15 

Leu Met Met Asn Trp Tyr His lie Ser Phe Thr Asp Arg Pro lie Tyr 
20 25 30 

lie Lys Asp Tyr Val Thr Leu Tyr Val Lys Asn lie Arg Glu Leu Val 
35 40 45 

Leu Ala Gly Asp Lys Ser Arg Pro Phe lie Thr Ser Ser Pro Thr Asn 
50 55 60 

Gly Ala Glu Thr Val Ala Glu Ala Trp Val Ser Gin Asn Pro Asn Ser 
65 70 75 80 

Asn Tyr Phe Gly Asp Val His Phe Tyr Asp Tyr lie Ser Asp Cys Trp 
85 90 95 

Asn Trp Lys Val Phe Pro Lys Ala Arg Phe Ala Ser Glu Tyr Gly Tyr 
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100 

Gin Ser Trp Pro Ser 
115 

Asp Trp Ser Phe Asn 
130 

Gly Gly Asn Lys Gin 
145 

Pro Gin Ser Thr Asp 
165 

Thr Gin Val Met Gin 
180 

Arg Arg Ser Arg Ser 
195 

Ala Leu Tyr Trp Gin 
210 



105 

Phe Ser Thr Leu Glu Lys 
120 

Ser Lys Phe Ser Leu His 
135 

Met Leu Tyr- Gin Ala Gly 
150 155 

Pro Leu Arg Thr Phe Lys 
170 

Ala Gin Cys Val Lys Thr 
185 

Glu lie Val Asp Gin Gin 
200 

Leu Asn Asp lie Trp Gin 
215 



110 

Val Ser Ser Thr Glu 
125 

Arg Gin His His Glu 
140 

Leu His Phe Lys Leu 
160 

Asp Thr lie Tyr Leu 
175 

Glu Thr Glu Phe Tyr 
190 

Gly His Thr Met Gly 
205 

Ala Pro Ser Trp 
220 



<210> 210 

<211> 160 

<212> PRT 

<213> Homo sapiens 

<400> 210 

Val Arg Val His Thr Trp Ser Ser Leu Glu Pro Val Cys Ser Arg Val 
15 10 15 

Thr Glu Arg Phe Val Met Lys Gly Gly Glu Ala Val Cys Leu Tyr Glu 
20 25 30 

Glu Pro Val Ser Glu Leu Leu Arg Arg Cys Gly Asn Cys Thr Arg Glu 
35 40 45 

Ser Cys Val Val Ser Phe Tyr Leu Ser Ala Asp His Glu Leu Leu Ser 
50 55 60 

Pro Thr Asn Tyr His Phe Leu Ser Ser Pro Lys Glu Ala Val Gly Leu 
65 70 75 80 

Cys Lys Ala Gin lie Thr Ala lie lie Ser Gin Gin Gly Asp lie Phe 
85 90 95 

Val Phe Asp Leu Glu Thr Ser Ala Val Ala Pro Phe Val Trp Leu Asp 
100 105 110 

Val Gly Ser lie Pro Gly Arg Phe Ser Asp Asn Gly Phe Leu Met Thr 
115 120 125 

Glu Lys Thr Arg Thr lie Leu Phe Tyr Pro Trp Glu Pro Thr Ser Lys 
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130 135 140 



Asn Glu Leu Glu Gin Ser Phe His Val Thr Ser Leu Thr Asp lie Tyr 
145 150 155 160 



<210> 211 
<211> 171 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (102) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<400> 211 

Pro Arg Leu Thr Pro Arg Met Lys Trp Pro Thr Ala Ala Leu Ala Ser 
15 10 15 

Arg Leu Leu Gly Trp Thr Val Leu Arg Pro Pro Tyr Pro Arg Val Pro 
20 25 30 

Ser Leu Pro Gin Val Thr Leu His Pro Thr Asp Gly Leu Met Ala Val 
35 40 45 

Leu Tyr Thr Gly Gly Glu Gly Arg Thr Leu Gly Glu Gin His Phe Phe 
50 55 60 

His Glu Thr Phe Val Thr Arg Trp Leu Leu Gly Pro Val Pro Val Arg 
65 70 75 80 

Phe Gly Ala Cys Ser Pro Leu Ser Phe Leu Ala Pro Arg Arg Gly Gin 
85 90 95 

Gly Ala Pro Ala Gly Xaa Phe Cys Ala Cys Pro Arg Pro Ala Ser Arg 
100 105 110 

Gin Leu Cys Pro Trp Pro Ala Leu Pro Gly Thr Pro Tyr Ser Asn Ser 
115 120 125 

Ala Pro Leu Cys Thr Gly Met Gly His Ser Asn Thr Pro Gin Gly Pro 
130 135 140 

Pro Ser Pro Gin Tyr Ala Leu Ser Pro Thr Glu Pro Thr Ser Leu Ser 
145 150 155 160 

Gly Asn Ser His Leu Pro Ala lie Leu Val Leu 
165 170 



<210> 212 
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<211> 41 

<212> PRT 

<213> Homo sapiens 

<400> 212 

Pro Arg Leu Thr Pro Arg Met Lys Trp Pro Thr Ala Ala Leu Ala Ser 
1 5 10 15 

Arg Leu Leu Gly Trp Thr Val Leu Arg Pro Pro Tyr Pro Arg Val Pro 
20 25 30 

Ser Leu Pro Gin Val Thr Leu His Pro 
35 40 



<210> 213 

<211> 41 

<212> PRT 

<213> Homo sapiens 



<400> 213 
Thr Asp Gly Leu 
1 

Leu Gly Glu Gin 
20 

Leu Gly Pro Val 
35 



Met Ala Val Leu 
5 

His Phe Phe His 

Pro Val Arg Phe 
40 



Tyr Thr Gly Gly 
10 

Glu Thr Phe Val 
25 

Gly 



Glu Gly Arg Thr 
15 

Thr Arg Trp Leu 
30 



<210> 214 

<211> 42 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (20) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 214 

Ala Cys Ser Pro Leu Ser Phe Leu Ala Pro Arg Arg Gly Gin Gly Ala 
15 10 15 

Pro Ala Gly Xaa Phe Cys Ala Cys Pro Arg Pro Ala Ser Arg Gin Leu 
20 25 30 

Cys Pro Trp Pro Ala Leu Pro Gly Thr Pro 
35 40 



<210> 215 
<211> 47 
<212> PRT 
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<213> Homo sapiens 
<400> 215 

Tyr Ser Asn Ser Ala Pro Leu Cys 
1 5 

Pro Gin Gly Pro Pro Ser Pro Gin 
20 

Thr Ser Leu Ser Gly Asn Ser His 
35 40 



Thr Gly Met Gly His Ser Asn Thr 
10 15 

Tyr Ala Leu Ser Pro Thr Glu Pro 
25 30 

Leu Pro Ala He Leu Val Leu 
45 



<210> 216 
<211> 27 
<212> PRT 

<213> Homo sapiens 
<400> 216 

His Leu Leu Glu Val Thr Pro Cys Arg Leu Pro Val Pro Glu Phe Pro 
15 10 15 

Gly Arg Thr Pro Arg Gly Ser Arg Thr Pro Asp 
20 25 



<210> 217 

<211> 239 

<212> PRT 

<213> Homo sapiens 

<400> 217 

Met He Pro Gly Ser Asp Ser Gin Thr Ala Leu Asn Phe Gly Ser Thr 
15 10 15 

Leu Met Lys Lys Lys Ser Asp Pro Glu Gly Pro Ala Leu Leu Phe Pro 
20 25 30 

Glu Ser Glu Leu Ser He Arg He Gly Arg Ala Gly Leu Leu Ser Asp 
35 40 45 

Lys Ser Glu Asn Gly Glu Ala Tyr Gin Arg Lys Lys Ala Ala Ala Thr 
50 55 ' 60 

Gly Leu Pro Glu Gly Pro Ala Val Pro Val Pro Ser Arg Gly Asn Leu 
65 70 75 80 

Ala Gin Pro Gly Gly Ser Ser Trp Arg Arg He Ala Leu Leu He Leu 
85 90 95 

Ala He Thr He His Asn Val Pro Glu Gly Leu Ala Val Gly Val Gly 
100 105 110 

Phe Gly Ala lie Glu Lys Thr Ala Ser Ala Thr Phe Glu Ser Ala Arg 
115 120 125 
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Asn Leu Ala lie 
130 

Val Ser Leu Pro 
145 

Trp Tyr Gly Gin 



Gly Ala Phe Ala 
180 

Ala Phe Ala Ala 
195 

Pro Glu Ala Gin 
210 

lie Leu Gly Phe 
225 



Gly lie Gly lie 
135 

Leu Arg Gly Ala 
150 

Leu Ser Gly Met 
165 

Val Val Leu Ala 



Gly Ala Met Val 
200 

lie Ser Gly Asn 
215 

Val Val Met Met 
230 



Gin Asn Phe Pro 
140 

Gly Phe Ser Thr 
155 

Val Glu Pro Leu 
170 

Glu Pro lie Leu 
185 

Tyr Val Val Met 



Gly Lys Leu Ala 
220 

Ser Leu Asp Val 
235 



Glu Gly Leu Ala 



Trp Arg Ala Phe 
160 

Ala Gly Val Phe 
175 

Pro Tyr Ala Leu 
190 

Asp Asp lie lie 
205 

Ser Trp Ala Ser 



Gly Leu Gly 



<210> 218 
<211> 43 
<212> PRT 

<213> Homo sapiens 
<400> 218 

Met lie Pro Gly Ser Asp Ser Gin 
1 5 

Leu Met Lys Lys Lys Ser Asp Pro 
20 

Glu Ser Glu Leu Ser lie Arg lie 

35 40 



Thr Ala Leu Asn Phe Gly Ser Thr 
10 15 

Glu Gly Pro Ala Leu Leu Phe Pro 
25 30 

Gly Arg Ala 



<210> 219 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<400> 219 

Gly Leu Leu Ser Asp Lys Ser Glu Asn Gly Glu Ala Tyr Gin Arg Lys 
1 5 10 15 

Lys Ala Ala Ala Thr Gly Leu Pro Glu Gly Pro Ala Val Pro Val Pro 
20 25 30 

Ser Arg Gly Asn Leu Ala Gin Pro Gly 
35 40 
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<210> 220 
<211> 44 
<212> PRT 

<213> Homo sapiens 
<400> 220 

Gly Ser Ser Trp Arg Arg lie Ala Leu Leu lie Leu Ala lie Thr lie 
15 10 15 

His Asn Val Pro Glu Gly Leu Ala Val Gly Val Gly Phe Gly Ala lie 
20 25 30 

Glu Lys Thr Ala Ser Ala Thr Phe Glu. Ser Ala Arg 
35 40 



<210> 221 

<211> 43 

<212> PRT 

<213> Homo sapiens 

<400> 221 

Asn Leu Ala lie Gly lie Gly He Gin Asn Phe Pro Glu Gly Leu Ala 
15 10 15 

Val Ser Leu Pro Leu Arg Gly Ala Gly Phe Ser Thr Trp Arg Ala Phe 
20 25 30 

Trp Tyr Gly Gin Leu Ser Gly Met Val Glu Pro 
35 40 



<210> 222 

<211> 43 

<212> PRT 

<213> Homo sapiens 

<400> 222 

Leu Ala Gly Val Phe Gly Ala Phe Ala Val Val Leu Ala Glu Pro He 
1 5 10 15 

Leu Pro Tyr Ala Leu Ala Phe Ala Ala Gly Ala Met Val Tyr Val Val 
20 25 30 

Met Asp Asp He He Pro Glu Ala Gin He Ser 
35 40 



<210> 223 
<211> 25 
<212> PRT 

<2 1 3 > Homo sapiens 
<400> 223 

Gly Asn Gly Lys Leu Ala Ser Trp Ala Ser He Leu Gly Phe Val Val 
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10 15 



Met Met Ser Leu Asp Val Gly Leu Gly 
20 25 



<210> 224 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 224 

Thr Arg Pro He Thr Tyr Val Leu Leu Ala Gly 
15 10 



<210> 225 

<211> 35 

<212> PRT 

<213> Homo sapiens 

<400> 225 

Gly Thr Ser Leu Thr Ala Pro Leu Leu Glu Phe Leu Leu Ala Leu Tyr 
15 10 15 

Phe Leu Phe Ala Asp Ala Met Gin Leu Asn Asp Lys Trp Gin Gly Leu 
20 25 30 

Cys Trp Pro 
35 



<210> 226 
<211> 30 
<212> PRT 

<213> Homo sapiens 
<400> 226 

Leu Ala Asn Phe Glx Cys Ser Asp Cys Ala Gin Thr Val Leu Phe Val 
1 5 10 15 

Leu Glx Phe Glx He Leu Val Phe Thr Tyr Glu He Pro Phe 
20 25 30 



<210> 227 

<211> 75 

<212> PRT 

<213> Homo sapiens 

<400> 227 

Gin Ala Trp His Glu Val Gly Gly Gly Val Arg Arg Cys Trp Phe Val 
15 10 15 

Leu Gly Glu Arg Arg Ala Gly Ser Leu Leu Ser Ala Ser Tyr Gly Thr 
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20 25 30 

Phe Ala Met Pro Gly Met Val Leu Phe Gly Arg Arg Trp Ala lie Ala 
35 40 45 

Ser Asp Asp Leu Val Phe Pro Gly Phe Phe Glu Leu Val Val Arg Val 
50 55 60 

Leu Trp Trp lie Gly lie Leu Thr Leu Tyr Leu 
65 70 75 



<210> 228 

<211> 125 

<212> PRT 

<213> Homo sapiens 

<400> 228 

Pro Gly Met Val Leu Phe Gly Arg Arg Trp Ala lie Ala Ser Asp Asp 
1 5 10 15 

Leu Val Phe Pro Gly Phe Phe Glu Leu Val Val Arg Val Leu Trp Trp 
20 25 30 

lie Gly lie Leu Thr Leu Tyr Leu Met His Arg Gly Lys Leu Asp Cys 
35 40 45 

Ala Gly Gly Ala Leu Leu Ser Ser Tyr Leu lie Val Leu Met lie Leu 
50 55 60 

Leu Ala. Val Val lie Cys Thr Val Ser Ala lie Met Cys Val Ser Met 
65 70 75 80 

Arg Gly Thr lie Cys Asn Pro Gly Pro Arg Lys Ser Met Ser Lys Leu 
85 90 95 

Leu Tyr lie Arg Leu Ala Leu Phe Phe Pro Glu Met Val Trp Ala Ser 
100 105 110 

Leu Gly Ala Ala Trp Val Ala Asp Gly Val Gin Cys Asp 
115 120 125 



<210> 229 
<211> 18 
<212> PRT 

<213> Homo sapiens 
<400> 229 

His Glu Arg Asn Cys Phe Pro Met Trp Leu Asn His Ser Ala Phe Pro 
15 10 15 



Pro Val 



WO 99/47540 



PCI7US99/05804 



132 



<210> 230 
<211> 132 
<212> PRT 
<213> Homo sapiens 

<400> 230 

Gly Trp Thr Arg Glu Asn Asp His Arg Ala Leu Ser Lys Ala Gly lie 
15 10 15 

Gly Ser Ala Glu lie Gin Pro Ser Asn Leu Arg Val Gly Ser Ala Lys 
20 25 30 

Asp Leu Gly Lys Pro Trp Ala Gly Lys Leu Leu Leu Leu Ser Ser Cys 
35 40 45 

Leu Leu Phe Phe Ser Leu Gly Val Leu Tyr Arg Gly Gin Met Leu Ala 
50 55 60 

Pro Pro Leu Gin Glu Asp Trp Lys Gly Gly Val Lys Asp Ser Asp Leu 
65 70 75 80 

lie Asp Asp Ser Ser Ala Ser Pro lie Pro Pro Ser Tyr Leu Glu Tyr 
85 90 95 

Lys Ala Ala Leu Tyr Pro Phe Ser Glu His Lys Ser Val Arg Asn Ala 
100 105 110 

Thr Asp Ser Leu Thr Phe Phe Leu Val Thr Asp His Phe Leu Asp Asn 
115 120 125 

Gin Asp Ser Gin 
130 



<210> 231 

<211> 45 

<212> PRT 

< 2 1 3 > Homo sap i ens 

<400> 231 

Gly Trp Thr Arg Glu Asn Asp His 
1 5 

Gly Ser Ala Glu lie Gin Pro Ser 
20 

Asp Leu Gly Lys Pro Trp Ala Gly 
35 40 



Arg Ala Leu Ser Lys Ala Gly lie 
10 15 

Asn Leu Arg Val Gly Ser Ala Lys 
25 30 

Lys Leu Leu Leu Leu 
45 



<210> 232 

<211> 46 

<212> PRT 

<213> Homo sapiens 
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<400> 232 

Ser Ser Cys Leu Leu Phe Phe Ser Leu Gly Val Leu Tyr Arg Gly Gin 
15 10 15 

Met Leu Ala Pro Pro Leu Gin Glu Asp Trp Lys Gly Gly Val Lys Asp 
20 25 30 

Ser Asp Leu lie Asp Asp Ser Ser Ala Ser Pro lie Pro Pro 
35 40 45 



<210> 233 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<400> 233 

Ser Tyr Leu Glu Tyr Lys Ala Ala Leu Tyr Pro Phe Ser Glu His Lys 
15 10 15 

Ser Val Arg Asn Ala Thr Asp Ser Leu Thr Phe Phe Leu Val Thr Asp 
20 25 30 

His Phe Leu Asp Asn Gin Asp Ser Gin 
35 40 



<210> 234 
<211> 11 
<212> PRT 

<213> Homo sapiens 
<400> 234 

Leu Lys Phe His Gin Glu Ser Leu Ser Gly Asp 
1 5 -10 



<210> 235 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 235 

Glu Ala Lys Ser Arg Pro Val Thr Gin Ala Gly Val Gin Trp His Asp 
15 10 15 

Leu Gly Ser Leu Gin Pro Leu Pro Pro 
20 25 



<210> 236 

<211> 25 

<212> PRT 

<213> Homo sapiens 
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<400> 236 

Glu Ala Lys Ser Arg Pro Val Thr Gin Ala Gly Val Gin Trp His Asp 
15 10 15 

Leu Gly Ser Leu Gin Pro Leu Pro Pro 
20 25 



<210> 237 
<211> 137 
<212> PRT 

<213> Homo sapiens 
<400> 237 

Ala Leu Val Leu Val Cys Arg Gin Arg Tyr Cys Arg Pro Arg Asp Leu 
15 10 15 

Leu Gin Arg Tyr Asp Ser Lys Pro lie Val Asp Leu lie Gly Ala Met 
20 25 30 

Glu Thr Gin Ser Glu Pro Ser Glu Leu Glu Leu Asp Asp Val Val lie 
35 40 45 

Thr Asn Pro His lie Glu Ala lie Leu Glu Asn Glu Asp Trp lie Glu 
50 55 60 

Asp Ala Ser Gly Leu Met Ser His Cys lie Ala lie Leu Lys lie Cys 
65 70 75 80 

His Thr Leu Thr Glu Lys Leu Val Ala Met Thr Met Gly Ser Gly Ala 
85 90 95 

Lys Met Lys Thr Ser Ala Ser Val Ser Asp lie lie Val Val Ala Lys 
100 105 110 

Arg Xle Ser Pro Arg Val Asp Asp Val Val Lys Ser Met Tyr Pro Pro 
115 120 125 

Leu Asp Pro Lys Leu Leu Asp Ala Arg 
130 135 



<210> 238 
<211> 319 
<212> PRT 

<213> Homo sapiens 
<400> 238 

Asp Val Glu Ser Arg Gly Pro Ser Ala Arg Cys Leu Pro Val Val Pro 
15 10 15 

Gly Ser Leu Leu Pro Gly Leu Glu Pro Ala Thr Lys Leu Met Pro Gly 
20 25 30 
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Gly Leu Ala Pro Gly His Gly Ala Pro Val Arg Glu ' Leu Leu Leu Pro 
35 40 45 

Leu Leu Ser Gin Pro Thr Leu Gly Ser Leu Trp Asp Ser Leu Arg His 
50 55 60 

Cys Ser Leu Leu Cys Asn Pro Leu Ser Cys Val Pro Ala Leu Glu Ala 
65 70 75 80 

Pro Pro Ser Leu Val Ser Leu Gly Cys Ser Gly Gly Cys Pro Arg Leu 
85 90 95 

Ser Leu Ala Gly Ser Ala Ser Pro Phe Pro Phe Leu Thr Ala Leu Leu 
100 105 110 

Ser Leu Leu Asn Thr Leu Ala Gin lie His Lys Gly Leu Cys Gly Gin 
115 120 125 

Leu Ala Ala lie Leu Ala Ala Pro Gly Leu Gin Asn Tyr Phe Leu Gin 
130 135 140 

Cys Val Ala Pro Gly Ala Ala Pro His Leu Thr Pro Phe Ser Ala Trp 
145 150 155 160 

Ala Leu Arg His Glu Tyr His Leu Gin Tyr Leu Ala Leu Ala Leu Ala 
165 170 175 

Gin Lys Ala Ala Ala Leu Gin Pro Leu Pro Ala Thr His Ala Ala Leu 
180 185 190 

Tyr His Gly Met Ala Leu Ala Leu Leu Ser Arg Leu Leu Pro Gly Ser 
195 200 205 

Glu Tyr Leu Thr His Glu Leu Leu Leu Ser Cys Val Phe Arg Leu Glu 
210 215 220 

Phe Leu Pro Glu Arg Thr Ser Gly Gly Pro Glu Ala Ala Asp Phe Ser 
225 230 235 240 

Asp Gin Leu Ser Leu Gly Ser Ser Arg Val Pro Arg Cys Gly Gin Gly 
245 250 255 

Thr Leu Leu Ala Gin Ala Cys Gin Asp Leu Pro Ser He Arg Asn Cys 
260 265 270 

Tyr Leu Thr His Cys Ser Pro Ala Arg Ala Ser Leu Leu Ala Ser Gin 
275 280 285 

Ala Leu His Arg Gly Glu Leu Gin Arg Val Pro Thr Leu Leu Leu Pro 
290 295 300 

Met Pro Thr Glu Pro Leu Leu Pro Thr Asp Trp Pro Phe Leu His 
305 310 315 
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<210> 239 

<211> 44 

<212> PRT 

<213> Homo sapiens 

<400> 239 

Asp Val Glu Ser Arg Gly Pro Ser Ala Arg Cys Leu Pro Val Val Pro 
15 10 15 

Gly Ser Leu Leu Pro Gly Leu Glu Pro Ala Thr Lys Leu Met Pro Gly 
20 25 30 

Gly Leu Ala Pro Gly His Gly Ala Pro Val Arg Glu 
35 40 



<210> 240 

<211> 45 

<212> PRT 

<213> Homo sapiens 

<400> 240 

Leu Leu Leu Pro Leu Leu Ser Gin Pro Thr Leu Gly Ser Leu Trp Asp 
15 10 15 

Ser Leu Arg His Cys Ser Leu Leu Cys Asn Pro Leu Ser Cys Val Pro 
20 25 30 

Ala Leu Glu Ala Pro Pro Ser Leu Val Ser Leu Gly Cys 
35 40 45 



<210> 241 

<211> 45 

<212> PRT 

<213> Homo sapiens 

<400> 241 

Ser Gly Gly Cys Pro Arg Leu Ser Leu Ala Gly Ser Ala Ser Pro Phe 
15 10 15 

Pro Phe Leu Thr Ala Leu Leu Ser Leu Leu Asn Thr Leu Ala Gin lie 
20 25 30 

His Lys Gly Leu Cys Gly Gin Leu Ala Ala lie Leu Ala 
35 40 45 



<210> 242 

<211> 44 

<212> PRT 

<213> Homo sapiens 

<400> 242 

Ala Pro Gly Leu Gin Asn Tyr Phe Leu Gin Cys Val Ala Pro Gly Ala 
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15 10 15 

Ala Pro His Leu Thr Pro Phe Ser Ala Trp Ala Leu Arg His Glu Tyr 
20 25 30 

His Leu Gin Tyr Leu Ala Leu Ala Leu Ala Gin Lys 
35 40 



<210> 243 
<211> 44 
<212> PRT 

<213> Homo sapiens 
<400> 243 

Ala Ala Ala Leu Gin Pro Leu Pro Ala Thr His Ala Ala Leu Tyr His 
15 10 15 

Gly Met Ala Leu Ala Leu Leu Ser Arg Leu Leu Pro Gly Ser Glu Tyr 
20 25 30 

Leu Thr His Glu Leu Leu Leu Ser Cys Val Phe Arg 
35 40 



<210> 244 
<211> 44 
<212> PRT 

<213> Homo sapiens 
<400> 244 

Leu Glu Phe Leu Pro Glu Arg Thr Ser Gly Gly Pro Glu Ala Ala Asp 
15 10 15 

Phe Ser Asp Gin Leu Ser Leu Gly Ser Ser Arg Val Pro Arg Cys Gly 
20 25 30 

Gin Gly Thr Leu Leu Ala Gin Ala Cys Gin Asp Leu 
35 40 



<210> 245 
<211> 53 
<212> PRT 

<213> Homo sapiens 



<400> 245 
Pro Ser lie Arg 
1 

Ser Leu Leu Ala 
20 

Pro Thr Leu Leu 
35 



Asn Cys Tyr Leu 
5 

Ser Gin Ala Leu 



Leu Pro Met Pro 
40 



Thr His Cys Ser 
10 

His Arg Gly Glu 
25 

Thr Glu Pro Leu 



Pro Ala Arg Ala 
15 

Leu Gin Arg Val 
30 

Leu Pro Thr Asp 
45 
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Trp Pro Phe Leu His 
50 



<210> 246 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 246 

Val Gly Ser Val Leu Gly Ala Phe 
1 5 

Ala Gin Thr His Arg Asp Ala Leu 
20 



Leu Thr Phe Pro Gly Leu Arg Leu 
10 15 

Thr 
25 



<210> 247 

<211> 65 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (21) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (37) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (48) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (57) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 247 

Leu Glu Cys Thr Asp Thr lie Met Val His Cys Ser Leu Lys Leu Leu 
15 10 15 

Ser Pro Ser Asp Xaa Ser His Ser Ala Ser Gin Val Ala Lys Thr Arg 
20 25 30 

Gly Val His His Xaa Thr Gin Leu lie Phe Lys Val Phe Phe Val Xaa 
35 40 45 

Met Gly Ser His Ser Thr Lys Tyr Xaa Ser lie Arg Pro Gly Leu Leu 
50 55 60 
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Pro 
65 



<210> 248 
<211> 14 
<212> PRT 

<2 1 3 > Homo sapiens 
<400> 248 

Glu Ser Ser Phe Val Pro Pro Ala Ala His Ser Ser Leu Cys 
15 10 



<210> 249 
<211> 172 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (72) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 249 

Leu Leu Pro Gly Gin Gin Glu Ala Thr Gin Cys Val Glu Ala Gly Ala 
15 10 15 

Gly Glu Gly Ala Leu Thr Pro Met Cys Pro Cys Arg Gin Glu Gin Phe 
20 25 30 

Val Asp Leu Tyr Lys Glu Phe Glu Pro Ser Leu Val Asn Ser Thr Val 
35 40 45 

Tyr lie Met Ala Met Ala lie Gin Met Ala Pro Phe Ala lie Asn Tyr 
50 55 60 

Lys Val Arg Pro Gly Pro Cys Xaa Asn lie His Cys Leu Pro Thr Gin 
65 70 75 80 

Pro His Pro Met Lys Pro Ser Val Pro His Pro His Arg Ala Arg Pro 
85 90 95 

Ser Trp Arg Ala Cys Pro Arg Thr Ser Pro Trp Cys Gly Val Trp Gin 
100 105 110 

Phe His Ser Trp Pro Ser Leu Ala Cys Ser Ser Ala Pro Arg Pro Thr 
115 120 125 

Ser Thr Ala Ser Leu Ala Ser Trp Thr Ser Leu Trp Ser Ser Ser Trp 
130 135 140 



Ser Leu Pro Arg Ser Cys Ser Trp Thr Ser Ala Trp Arg Ser Trp Pro 
145 150 155 160 
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Thr Ala Ser Cys Ser Ser Ser Trp Gly Pro Arg Ser 
165 170 



<210> 250 

<211> 45 

<212> PRT 

<213> Homo sapiens 

<400> 250 

Leu Leu Pro Gly Gin Gin Glu Ala Thr Gin Cys Val Glu Ala Gly Ala 
15 10 15 

Gly Glu Gly Ala Leu Thr Pro Met Cys Pro Cys Arg Gin Glu Gin Phe 
20 25 30 

Val Asp Leu Tyr Lys Glu Phe Glu Pro Ser Leu Val Asn 
35 40 45 



<210> 251 

<211> 44 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (27) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 251 

Ser Thr Val Tyr lie Met Ala Met Ala lie Gin Met Ala Pro Phe Ala 
15 10 15 

lie Asn Tyr Lys Val Arg Pro Gly Pro Cys Xaa Asn He His Cys Leu 
20 25 30 

Pro Thr Gin Pro His Pro Met Lys Pro Ser Val Pro 
35 40 



<210> 252 

<211> 42 

<212> PRT 

<213> Homo sapiens 

<400> 252 

His Pro His Arg Ala Arg Pro Ser Trp Arg Ala Cys Pro Arg Thr Ser 
15 10 15 

Pro Trp Cys Gly Val Trp Gin Phe His Ser Trp Pro Ser Leu Ala Cys 
20 25 30 

Ser Ser Ala Pro Arg Pro Thr Ser Thr Ala 
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35 40 



<210> 253 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<400> 253 

Ser Leu Ala Ser Trp Thr Ser Leu Trp Ser Ser Ser Trp Ser Leu Pro 
15 10 15 

Arg Ser Cys Ser Trp Thr Ser Ala Trp Arg Ser Trp Pro Thr Ala Ser 
20 25 30 

Cys Ser Ser Ser Trp Gly Pro Arg Ser 
35 40 



<210> 254 
<211> 48 
<212> PRT 

<213> Homo sapiens 
<400> 254 

Thr Arg Asn lie Leu Ser Phe lie Lys Cys Val lie His Asn Phe Trp 
1.5 10 15 

lie Pro Lys Glu Ser Asn Glu lie Thr lie lie lie Asn Pro Tyr Arg 
20 25 30 

Glu Thr Val Cys Phe Ser Val Glu Pro Val Lys Lys lie Phe Asn Tyr 
35 40 45 



<210> 255 
<211> 27 
<212> PRT 

<213> Homo sapiens 
<400> 255 

Leu Val Val Leu Phe Ala Ser Ser Asn Ser Arg Tyr Leu Lys Tyr Phe 
15 10 15 

Phe Leu Val Pro Leu lie Leu Gly Ser Ala Trp 
20 25 



<210> 256 
<211> 20 
<212> PRT 

<213> Homo sapiens 
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<400> 256 

His Glu Trp Lys Cys Lys Gin Lys Tyr Ser Glu Gly Ser Gly Asn Thr 
15 10 15 

Arg lie Gly Asn 
20 



<210> 257 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 257 

Leu Leu Pro Leu Cys Phe Leu Gly Pro Arg Gin Val Leu Glu Glu Phe 
15 10 15 

Pro Ser lie Val 
20 



<210> 258 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 258 

Pro Thr Arg Pro Ser Lys His Gin Glu Ala Gly Ser 
15 10 



<210> 259 

<211> 42 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (39) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<400> 259 

Gly Gin Gly Pro Ala Gly Arg Trp Val Arg Arg Leu Pro Cys Ser Arg 
15 10 15 

Arg Ala Gly Gly Glu Arg Gly Pro His Trp Gly Val Trp Ala Gly Pro 
20 25 30 

Gin Met Ser Cys Gly Leu Xaa Phe Gly Pro 
35 40 



<210> 260 
<211> 193 
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<212> PRT 

<213> Homo sapiens 
<400> 260 

Trp Arg Thr Gin Gly Pro Met Val Leu Leu Trp Val Val Thr Cys Pro 
1 5 10 15 

Ala Thr Met Leu Thr Glu Pro Gin Asn Pro His Leu lie Gly Phe Val 
20 25 30 

Ala Tyr Ser Gly Pro Ser His Thr Thr Gin Pro His Lys Tyr Trp Leu 
35 40 45 

Leu Leu Asp Gly Gin Ala Asp Pro Ala Ala Ala Glu Gly Pro Val Lys 
50 55 60 

Arg Lys Ala Ala Ser Val Val Trp Trp Pro Gin Ala Leu Arg His Leu 
65 70 75 80 

Ser Leu Leu Val His Cys Trp Glu Glu Ser Tyr Glu Met Asn lie Gly 
85 90 95 

Cys Gin Ser Leu Trp Ala Gly Gly Leu Ala Ser Ser Gly Asn Gly Trp 
100 105 110 

Asp Leu Gly Val Ala Phe Arg Arg Asp Thr Cys Met Ser Ser Ser Ser 
115 120 125 

Leu His Trp Lys Glu Phe Lys Tyr Ala Pro Gly Ser Leu His Tyr Phe 
130 135 140 

Ala Leu Ser Phe Val Leu lie Leu Thr Glu lie Cys Leu Val Ser Ser 
145 150 155 160 

Gly Met Gly Phe Pro Gin Glu Gly Lys His Phe Ser Val Leu Gly Ser 
165 170 175 

Pro Asp Cys Ser Leu Trp Gly Arg Asp Glu His Val Pro Arg Glu Phe 
180 185 190 

Ala 



<210> 261 
<211> 42 
<212> PRT 

<213> Homo sapiens 
<400> 261 

Trp Arg Thr Gin Gly Pro Met Val Leu Leu Trp Val Val Thr Cys Pro 
15 10 15 

Ala Thr Met Leu Thr Glu Pro Gin Asn Pro His Leu lie Gly Phe Val 
20 25 30 
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Ala Tyr Ser Gly Pro Ser His Thr Thr Gin 
35 40 



<210> 262 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<400> 262 

Pro His Lys Tyr Trp Leu Leu Leu Asp Gly Gin Ala Asp Pro Ala Ala 
1 5 10 15 

Ala Glu Gly Pro Val Lys Arg Lys Ala Ala Ser Val Val Trp Trp Pro 
20 25 30 

Gin Ala Leu Arg His Leu Ser Leu Leu 
35 40 



<210> 263 

<211> 41 

<212> PRT 

<213> Homo sapiens 

<400> 263 

Val His Cys Trp Glu Glu Ser Tyr Glu Met Asn He Gly Cys Gin Ser 
15 10 15 

Leu Trp Ala Gly Gly Leu Ala Ser Ser Gly Asn Gly Trp Asp Leu Gly 
20 25 30 

Val Ala Phe Arg Arg Asp Thr Cys Met 
35 40 



<210> 264 

<211> 44 

<212> PRT 

<213> Homo sapiens 

<400> 264 

Ser Ser Ser Ser Leu His Trp Lys Glu Phe Lys Tyr Ala Pro Gly Ser 
15 10 15 

Leu His Tyr Phe Ala Leu Ser Phe Val Leu He Leu Thr Glu He Cys 
20 25 30 

Leu Val Ser Ser Gly Met Gly Phe Pro Gin Glu Gly 
35 40 



<210> 265 
<211> 25 
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<212> PRT 

<213> Homo sapiens 
<400> 265 

Lys His Phe Ser Val Leu Gly Ser Pro Asp Cys Ser Leu Trp Gly Arg 
1 5 10 15 

Asp Glu His Val Pro Arg Glu Phe Ala 
20 25 



<210> 266 

<211> 31 

<212> PRT 

<213> Homo sapiens 

<400> 266 

lie Ala Gin Gly Thr Val Pro Leu Thr Lys Arg Gly Val Gin Ser Ser 
15 10 15 

Gly Pro Asp Tyr Pro Glu Gly Thr Leu Thr Pro Leu Pro Arg Gly 
20 25 30 



<210> 267 

<211> 31 

<212> PRT 

<213> Homo sapiens 

<400> 267 

lie Ala Gin Gly Thr Val Pro Leu Thr Lys Arg Gly Val Gin Ser Ser 
15 10 15 

Gly Pro Asp Tyr Pro Glu Gly Thr Leu Thr Pro Leu Pro Arg Gly 
20 25 30 



<210> 268 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<400> 268 

Asp Cys Leu Tyr Leu Ala Leu Ser Phe Pro Trp His Cys His Cys His 
15 10 15 

His His Pro Pro Ser Gly Ser Leu Leu Tyr Pro Phe 
20 25 



<210> 269 
<211> 101 
<212> PRT 
<213> Homo sapiens 
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<400> 269 

Ala Ser Leu Pro Pro Ser Arg Ser Arg Pro Leu Ala Asn Met Ala Leu 
1 5 . 10 15 

Val Pro Cys Gin Val Leu Arg Met Ala lie Leu Leu Ser Tyr Cys Ser 
20 25 30 

* lie Leu Cys Asn Tyr Lys Ala lie Glu Met Pro Ser His Gin Thr Tyr 
35 40 45 

Gly Gly Ser Trp Lys Phe Leu Thr Phe lie Asp Leu Val lie Gin Ala 
50 55 60 

Val Phe Phe Gly lie Cys Val Leu Thr Asp Leu Ser Ser Leu Leu Thr 
65 70 75 80 

Arg Gly Ser Gly Asn Gin Glu Gin Glu Arg Gin Leu Lys Lys Leu lie 
85 90 95 

Ser Leu Arg Asp Trp 
100 



<210> 270 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 270 

Met Ser Arg Ser Ser Arg lie Ser Gly Leu Ser Cys Pro Trp Leu Leu 
15 10 15 



<210> 271 

<211> 45 

<212> PRT 

<213> Homo sapiens 

<400> 271 

Asp His Trp Pro Ala Gly Phe Leu Pro Pro Ala Pro Gly Leu Lys Phe 
1 5 10 15 

Pro Val Ala Leu Glu Val Phe Arg Lys Val Leu Pro Ala Val Cys Pro 
20 25 30 

Thr Asp Cys Ser Gly Ser Ala Gly Lys Glu Arg Asn Ser 
35 40 45 



<210> 272 
<211> 47 
<212> PRT 
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<213> Homo sapiens 
<400> 272 

Glu Glu lie Ala Thr Ser lie Glu Pro lie Arg Asp Phe Leu Ala He 
15 10 15 

Val Phe Phe Ala Ser He Gly Leu His Val Phe Pro Thr Phe Val Ala 
20 25 30 

Tyr Glu Leu Thr Val Leu Val Phe Leu Thr Leu Ser Val Val Val 
35 40 45 



<210> 273 
<211> 7 
<212> PRT 

<213> Homo sapiens 
<400> 273 

Tyr Cys Asn Leu Gin Cys Arg 
1 5 



<210> 274 
<211> 44 
<212> PRT 

<213> Homo sapiens 
<400> 274 

Ser Ala Leu lie Gly Asn Pro Lys Gly Cys Phe Gly Cys Phe Ser Pro 
15 10 15 

Val Val Leu Arg Glu Trp Ser Val Glu Ser Trp Lys Ser Leu Arg Pro 
20 25 30 

Phe Gin Ala He Cys Lys Leu Lys Thr Asn Phe Arg 
35 40 



<210> 275 

<211> 8 

<212> PRT 

<213> Homo sapiens 

<400> 275 

His Glu Ala Ala Leu Arg Gly Pro 
1 5 



<210> 276 
<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 276 
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Ser Asn Ala Ala Gly Asn Val Val Arg Ala Phe Leu Tyr lie Asn His 
15 10 15 

Leu Lys Leu Gly Cys Lys Val Gly Leu Ala 
20 25 



<210> 277 . 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 277 

Asn Trp Ala Val Leu Asn Met Leu Leu Ser Lys Gly Lys lie Thr lie 
15 10 15 

Phe Leu Gly Pro Leu Glu Cys Gly Ser 
20 25 



<210> 278 

<211> 49 

<212> PRT 

<213> Homo sapiens 

<400> 278 

Pro Ser His Gin Thr Arg Lys Gly Lys Ser Ala Lys Leu Leu Asp Arg 
15 10 15 

Pro Pro Glu Ala Leu Arg Met Lys lie lie Thr Thr Thr Leu Leu Leu 
20 25 30 

Ala Cys His Leu Gin Leu Glu Val Gly Val Val Val Gly Gly Glu Val 
35 40 45 

Asp 



<210> 279 
<211> 51 
<212> PRT 

<213> Homo sapiens 
<400> 279 

Phe Gin Ala Ser Ser Ala Asn Asn Gin Gin Asn Trp Gly Ser Gin Pro 
15 10 15 

lie Ala Gin Gin Pro Leu Gin Gin Gly Gly Asp Tyr Ser Gly Asn Tyr 
20 25 30 

Gly Tyr Asn Asn Asp Asn Gin Glu Phe Tyr Gin Asp Thr Tyr Gly Gin 
35 40 45 

Gin Trp Lys 
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50 



<210> 280 
<211> 264 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (2) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (6) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (14) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 280 

Trp Xaa Pro Leu Leu Xaa Thr Ser Gly Ser Pro Gly Leu Xaa Gly Phe 
15 10 15 

Gly Thr Arg Met Asn Gly Lys Glu lie Glu Gly Glu Glu lie Glu He 
20 25 30 

Val Leu Ala Lys Pro Pro Asp Lys Lys Arg Lys Glu Arg Gin Ala Ala 
35 40 45 

Arg Gin Ala Ser Arg Ser Thr Ala Tyr Glu Asp Tyr Tyr Tyr His Pro 
50 55 60 

Pro Pro Arg Met Pro Pro Pro He Arg Gly Arg Gly Arg Gly Gly Gly 
65 70 75 80 

Arg Gly Gly Tyr Gly Tyr Pro Pro Asp Tyr Tyr Gly Tyr Glu Asp Tyr 
85 90 95 

Tyr Asp Asp Tyr Tyr Gly Tyr Asp Tyr His Asp Tyr Arg Gly Gly Tyr 
100 105 110 

Glu Asp Pro Tyr Tyr Gly Tyr Asp Asp Gly Tyr Ala Val Arg Gly Arg 
115 120 125 

Gly Gly Gly Arg Gly Gly Arg Gly Ala Pro Pro Pro Pro Arg Gly Arg 
130 135 140 

Gly Ala Pro Pro Pro Arg Gly Arg Ala Gly Tyr Ser Gin Arg Gly Ala 
145 150 155 160 



Pro Leu Gly Pro Pro Arg Gly Ser Arg Gly Gly Arg Gly Gly Pro Ala 
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165 

Gin Gin Gin Arg Gly 
180 

Gly Asn Val Gly Gly 
195 

Ser Lys Arg Arg Gin 
210 

Ser Leu Ser Ser Arg 
225 

Val Thr He Met Thr 
245 

Ser Gly Ser Arg Gin 
260 



170 

Arg Gly Ser Arg Gly Ser 
185 

Lys Arg Lys Ala Asp Gly 
200 

Pro Thr Thr Asn Arg Thr 
215 

Phe Ser Lys Val Val Thr 
230 235 

Thr Arg Asn Phe He Arg 
250 

Val Arg Ala 



175 

Arg Gly Asn Arg Gly 
190 

Tyr Asn Gin Pro Asp 
205 

Gly Val Pro Asn Pro 
220 

He Leu Val Thr Met 
240 

He Leu Met Gly Asn 
255 



<210> 281 

<211> 27 

<212> PRT 

<213> Homo sapiens 

<400> 281 

Arg Met Asn Gly Lys Glu He Glu Gly Glu Glu He Glu He Val Leu 
15 10 15 

Ala Lys Pro Pro Asp Lys Lys Arg' Lys Glu Arg 
20 25 



<210> 282 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 282 

Tyr Tyr His Pro Pro Pro Arg Met 
1 5 

Arg Gly Gly Gly Arg Gly Gly Tyr 
20 



Pro Pro Pro He Arg Gly Arg Gly 
10 15 

Gly 
25 



<210> 283 

<211> 26 

<212> PRT 

<213> Homo sapiens 

<400> 283 

Asp Tyr Arg Gly Gly Tyr Glu Asp Pro Tyr Tyr Gly Tyr Asp Asp Gly 
15 10 15 
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Tyr Ala Val Arg Gly Arg Gly Gly Gly Arg 
20 25 



<210> 284 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<400> 284 

Pro Pro Pro Arg Gly Arg Ala Gly Tyr Ser Gin Arg Gly Ala Pro Leu 
15 10 15 

Gly Pro Pro Arg Gly Ser Arg Gly Gly Arg Gly Gly 
20 25 



<210> 285 

<211> 35 

<212> PRT 

<213> Homo sapiens 

<400> 285 

Ala Asp Gly Tyr Asn Gin Pro Asp Ser Lys Arg Arg Gin Pro Thr Thr 
1 5 10 15 

Asn Arg Thr Gly Val Pro Asn Pro Ser Leu Ser Ser Arg Phe Ser Lys 
20 25 30 

Val Val Thr 
35 



<210> 286 
<211> 19 
<212> PRT 

<213> Homo sapiens 
<400> 286 

Leu Gin lie Pro Pro Ser Ser Gin Ser Leu Gly Leu Lys Asn Ala Asp 
15 10 15 

Ser Ser lie 



<210> 287 
<211> 129 
<212> PRT 
<213> Homo sapiens 

<400> 287 

Gly Gly Pro Pro Glu Ser Ala Pro Trp Leu Pro Ala Val Leu Arg Ala 
15 10 15 



BNSDOCID: <WO 9947540A1 J_> 



WO 99/47540 



PCT/US99/05804 



152 



Pro Val Leu Thr 
20 

Trp Phe Cys Gin 
35 

His Cys lie Leu 
50 

Ser Met Trp Thr 
65 

Thr Gly Ala Ser 



Pro Leu Pro Leu 
100 

Leu Ser Leu Glu 
115 

Gly 



Ser Arg Cys Ala 



Pro Gly Ser Gly 
40 

Gly Pro Gly Ser 
55 

Pro Ser Val Pro 
70 

Ser Cys Ser Val 
85 

His Asn His Gin 



His Val Pro Gly 
120 



Ser Ser Asp Ser 
25 

Pro Ser Ser Thr 



Ser Cys Leu Cys 
60 

Gly Trp Pro Gin 
75 

Phe Ser Ala Asn 
90 

Arg Gin Ala Ser 
105 

Glu Ser Tyr Phe 



Glu Gly Pro Val 
30 

Glu Met Ser Cys 
45 

Val Leu Arg Gly 



Pro Ala Lys Glu 
80 

Asn Gly Ser Cys 
95 

Leu Asp Thr Gly 
110 

Tyr Ser Pro Val 
125 



<210> 288 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<400> 288 

Ser Ser Asp Ser Glu Gly Pro Val Trp Phe Cys Gin Pro Gly Ser Gly 
15 10 15 

Pro Ser Ser Thr Glu Met Ser Cys His Cys lie Leu Gly Pro Gly Ser 
20 25 30 

Ser Cys 



<210> 289 

<211> 28 

<212> PRT 

<213> Homo sapiens 

<400> 289 

Trp Thr Pro Ser Val Pro Gly Trp Pro Gin Pro Ala Lys Glu Thr Gly 
15 10 15 

Ala Ser Ser Cys Ser Val Phe Ser Ala Asn Asn Gly 
20 25 
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<210> 290 
<211> 21 
<212> PRT 

<213> Homo sapiens 
<400> 290 

Gin Arg Gin Ala Ser Leu Asp Thr Gly Leu Ser Leu Glu His Val Pro 
15 10 15 



Gly Glu Ser Tyr Phe 
20 



<210> 291 
<211> 29 
<212> PRT 

<213> Homo sapiens 
<400> 291 

Ser Ser Ser Leu Val Leu Thr lie Arg Ser Gin Thr Leu Phe Leu Ala 
1 5 10 15 

Ser Phe lie His Ser Thr Ser lie Phe Cys Ala Leu Asn 
20 25 



<210> 292 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 292 

Cys Cys Cys Arg Leu Gly Leu Ser Gly Pro Lys Cys 
15 10 



<210> 293 
<211> 22 
<212> PRT 

<213> Homo sapiens 
<400> 293 

Arg Ala Phe Trp Gly Leu Gly Ala Leu Gin Leu Leu Asp Leu Ser Ala 
15 10 15 

Asn Gin Leu Glu Ala Leu 
20 



<210> 294 
<211> 34 
<212> PRT 

<213> Homo sapiens 
<400> 294 
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His Ala Ser Gly Arg Arg Thr Gly Ser Ala Asp Asp Gly Leu Gin Gly 
15 10 15 

Arg Thr Gly Ser Gly Pro Pro Thr Ala Gly Ala Gly Gly Gly Gly Ala 
20 25 30 

Ala Pro 



<210> 295 
<211> 205 
<212> PRT 
<213> Homo sapiens 

<400> 295 

Val Ser Ala Ala Ala Gly Ala Arg Leu Ala Pro Arg Ala Pro Gly Ala 
15 10 15 

Pro Ala Gly Cys Arg Pro Met Arg Gly Cys Ala Ala Arg Ala Ala Ala 
20 25 30 

Arg Lys Ser Leu Val Pro Val Leu Pro Ala Gly Trp Arg Ser Gly Pro 
35 40 45 

Ala Ala Ala Ala Arg Pro Gly Pro Arg Arg Leu Ala His Ala Pro Ser 
50 55 60 

Ala Ala Arg Ser Arg Ala Gly Pro Gly Ala Val Ala Arg Pro Leu Pro 
65 70 75 "80 

Arg Arg His Leu Ala Ala Ala His Gly Arg Gly Cys Gly Pro Ala Ala 
85 90 95 

Ala Arg Ala Gly Ala Gly Ser Gly Pro Gly Ala Arg Arg Ala Ala Arg 
100 105 110 

Val Pro Thr Ala Gly Arg Pro Pro Gly Thr His Val His Thr Ser Gly 
115 120 125 

Gin Ser Gly Ala Pro Arg Asp Pro Glu Gly Glu Ala Leu Ala Asp Thr 
130 135 140 

Trp Ala Gin Thr Gly Gin Gly Asp Ser Ser Ser Asn Ser Ser Ser Ser 
145 150 155 160 

Gly Arg Gly Arg Asp Gin Glu Gly Pro Arg Met Gly Ala Ala Pro Pro 
165 170 175 

Pro Pro Ala Pro Ala Val Gly Gly Pro Leu Pro Val Arg Pro Trp Ser 
180 185 190 

Pro Ser Ser Ala Glu Pro Val Leu Arg Pro Asp Ala Trp 
195 200 205 
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<210> 296 
<211> 368 
<212> PRT 
<213> Homo sapiens 

<400> 296 

Thr Arg Pro Ala Ala Glu Arg Ala Pro Arg Thr Thr Gly Ser Arg Asp 
1 5 10 15 

Ala Gin Ala Ala Gly Leu Pro Pro Arg Val Pro Gly Ala Gly Gly Leu 
20 25 30 

Pro Pro Cys Gly Ala Leu Pro Gly Arg Gly Leu Gly Arg Cys Cys Cys 
35 40 45 

Cys Cys Cys Cys Cys Arg Leu Gly Leu Ser Gly Pro Lys Cys Arg Pro 
50 55 60 

Gly Pro Arg Pro Arg Gly Pro Trp Ala Pro Arg Thr Ala Pro Arg Cys 
65 70 75 80 

Ala Arg Ala Cys Arg Glu Ala Cys Gin Leu Ser Ala Leu Ser Leu Pro 
85 90 95 

Ala Val Pro Pro Gly Leu Ser Leu Arg Leu Arg Ala Leu Leu Leu Asp 
100 105 110 

His Asn Arg Val Arg Ala Leu Pro Pro Gly Ala Phe Ala Gly Ala Gly 
115 120 125 

Ala Leu Gin Arg Leu Asp Leu Arg Glu Asn Gly Leu His Ser Val His 
130 135 140 

Val Arg Ala Phe Trp Gly Leu Gly Ala Leu Gin Leu Leu Asp Leu Ser 
145 150 155 160 

Ala Asn Gin Leu Glu Ala Leu Ala Pro Gly Thr Phe Ala Pro Leu Arg 
165 170 175 

Ala Leu Arg Asn Leu Ser Leu Ala Gly Asn Arg Leu Ala Arg Leu Glu 
180 185 190 

Pro Ala Ala Leu Gly Ala Leu Pro Leu Leu Arg Ser Leu Ser Leu Gin 
195 200 205 

Asp Asn Glu Leu Ala Ala Leu Ala Pro Gly Leu Leu Gly Arg Leu Pro 
210 215 220 

Ala Leu Asp Ala Leu His Leu Arg Gly Asn Pro Trp Gly Cys Gly Cys 
225 230 235 240 

Ala Leu Arg Pro Leu Cys Ala Trp Leu Arg Arg His Pro Leu Pro Ala 
245 250 255 
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Ser Glu Ala Glu Thr Val Leu Cys 
260 

Ser Pro Leu Thr Ala Phe Ser Asp 
275 280 

Pro Leu Ala Leu Arg Asp Leu Ala 
290 295 



Val Trp Pro Gly Arg Leu Thr Leu 
265 270 

Ala Ala Phe Ser His Cys Ala Gin 
285 

Arg Gly Leu His Ala Arg Ala Gly 
300 



Leu Leu Pro Arg Gin Pro Gly Phe Leu Pro Gly Ala Gly Leu Trp Ala 

305 310 315 320 

His Arg Leu Pro Cys Ala Pro Pro Pro Pro Pro His Arg Arg Pro Pro 

325 330 335 



Pro Ala Glu Thr Val Gin Thr Arg 
340 

Val Pro Arg Pro Arg Thr Arg Gly 
355 360 



Thr Pro lie Pro Thr Pro Thr Ala 
345 350 

Ala Pro Ser Ala Ala Ala Gin Ala 
365 



<210> 297 

<211> 47 

<212> PRT 

<213> Homo sapiens 

<400> 297 

Gly Cys Arg Pro Met Arg Gly Cys Ala Ala Arg Ala Ala Ala Arg Lys 
15 10 15 

Ser Leu Val Pro Val Leu Pro Ala Gly Trp Arg Ser Gly Pro Ala Ala 
20 25 30 

Ala Ala Arg Pro Gly Pro Arg Arg Leu Ala His Ala Pro Ser Ala 
35 40 45 



<210> 298 

<211> 30 

<212> PRT 

<213> Homo sapiens 

<400> 298 

Pro Gly Ala Val Ala Arg Pro Leu Pro Arg Arg His Leu Ala Ala Ala 
15 10 15 

His Gly Arg Gly Cys Gly Pro Ala Ala Ala Arg Ala Gly Ala 
20 25 30 



<210> 299 
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<211> 24 
<212> PRT 

<213> Homo sapiens 
<400> 299 

Ser Gly Gin Ser Gly Ala Pro Arg Asp Pro Glu Gly Glu Ala Leu Ala 
15 10 15 

Asp Thr Trp Ala Gin Thr Gly Gin 
20 



<210> 300 

<211> 23 

<212> PRT 

<213> Homo sapiens 

<400> 300 

Pro Pro Ala Pro Ala Val Gly Gly Pro Leu Pro Val Arg Pro Trp Ser 
15 10 15 

Pro Ser Ser Ala Glu Pro Val 
20 



<210> 301 
<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 301 

Ala Pro Arg Thr Thr Gly Ser Arg Asp Ala Gin Ala Ala Gly Leu Pro 
15 10 15 

Pro Arg Val Pro Gly Ala Gly Gly Leu Pro 
20 25 



<210> 302 
<211> 22 
<212> PRT 

<213> Homo sapiens 
<400> 302 

Gly Pro Arg Pro Arg Gly Pro Trp Ala Pro Arg Thr Ala Pro Arg Cys 
15 10 15 

Ala Arg Ala Cys Arg Glu 
20 



<210> 303 
<211> 31 
<212> PRT 

<213> Homo sapiens 



1 

WO 99/47540 
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<400> 303 

Ala Val Pro Pro Gly Leu Ser Leu 
1 5 

His Asn Arg Val Arg Ala Leu Pro 
20 



Arg Leu Arg Ala Leu Leu Leu Asp 
10 15 

Pro Gly Ala Phe Ala Gly Ala 
25 30 



<210> 304 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 304 

Leu Gly Ala Leu Gin Leu Leu Asp Leu Ser Ala Asn Gin Leu Glu Ala 
15 10 15 

Leu Ala Pro Gly Thr Phe Ala Pro 
20 



<210> 305 

<211> 36 

<212> PRT 

<213> Homo sapiens 

<400> 305 

Pro Pro Gly Ala Phe Ala Gly Ala Gly Ala Leu Gin Arg Leu Asp Leu 
15 10 15 

Arg Glu Asn Gly Leu His Ser Val His Val Arg Ala Phe Trp Gly Leu 
20 25 30 

Gly Ala Leu Gin 
35 



<210> 306 

<211> 28 

<212> PRT 

<2 1 3 > Homo sapiens 

<400> 306 

Arg Asn Leu Ser Leu Ala Gly Asn Arg Leu Ala Arg Leu Glu Pro Ala 
15 10 15 

Ala Leu Gly Ala Leu Pro Leu Leu Arg Ser Leu Ser 
20 25 



<210> 307 

<211> 26 

<212> PRT 

<213> Homo sapiens 
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<400> 307 

Leu Pro Ala Leu Asp Ala Leu His Leu Arg Gly Asn Pro Trp Gly Cys 
15 10 15 

Gly Cys Ala Leu Arg Pro Leu Cys Ala Trp 
20 25 



<210> 308 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<400> 308 

Thr Val Leu Cys Val Trp Pro Gly Arg Leu Thr Leu Ser Pro Leu Thr 
15 10 15 

Ala Phe Ser Asp Ala Ala Phe Ser His Cys Ala Gin Pro Leu Ala Leu 
20 25 30 

Arg Asp 



<210> 309 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 309 

Leu His Ala Arg Ala Gly Leu Leu Pro Arg Gin Pro Gly Phe Leu Pro 
1 5 10 15 

Gly Ala Gly Leu Trp Ala His Arg 
20 



<210> 310 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 310 

Thr Val Gin Thr Arg Thr Pro He Pro Thr Pro Thr Ala Val Pro Arg 
15 10 15 

Pro Arg Thr Arg Gly Ala Pro Ser 
20 



<210> 311 

<211> 59 

<212> PRT 

< 2 1 3 > Homo sapiens 
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<400> 311 
His Ala Ser Gly 
1 

Gly Leu Pro Cys 
20 

Cys Arg Leu Cys 
35 

Leu Cys Ser Asp 
50 



Arg Pro Asp Arg 
5 

Pro Asp Leu Glu 



Ala Pro Thr Glu 
40 

Arg Cys Asp Thr 
55 



Ser Ser Ala Pro 
10 

Pro Leu Gly Gly 
25 

Ala Arg Gly Leu 



Trp Arg Ser 



lie Gly Asn Ser 
15 

Leu Gin Ser Lys 
30 

Trp Ser Arg Ser 
45 



<210> 312 

<211> 29 

<212> PRT 

<213> Homo sapiens 

<400> 312 

Gly Leu Pro Cys Pro Asp Leu Glu Pro Leu Gly Gly Leu Gin Ser Lys 
15 10 15 

Cys Arg Leu Cys Ala Pro Thr Glu Ala Arg Gly Leu Trp 
20 25 



<210> 313 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 313 

Gin Glu Trp Glu Ser Glu Leu Gly Glu Arg Arg Lys Pro Leu Gin Ala 
15 10 15 



<210> 314 

<211> 46 

<212> PRT 

<213> Homo sapiens 

<400> 314 

Cys Gin Ser Ser Asn Leu lie Phe Phe Gin Phe Val Asn lie Leu Phe 
15 10 15 

Asn Leu Met Met Asp lie Leu Val Asp Phe Ser lie Thr Lys Met Pro 
20 25 30 

lie Asn Ser lie Phe Ser Leu Tyr Phe Cys Tyr Glu lie lie 
35 40 45 
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<210> 315 
<211> 134 
<212> PRT 
<213> Homo sapiens 

<400> 315 

Gly Pro Val Trp Leu Phe Cys Phe Leu Thr Leu Cys Arg Lys Pro Ser 
1 5 10 15 

Gin Leu Phe Ser Gin Glu Asn Ser Cys Met Asp Val Ala Gly Gly Val 
20 25 30 

Thr Thr Cys Leu Pro Pro Trp Phe Ser Arg Gly Ala Pro Ala Gin Met 
35 40 45 

Ser Gin Trp Pro Pro Ser Ser Asp His Gly Ala Val Arg Ala Gly Arg 
50 55 60 

Asp Ser Arg Val Gly Pro Val Gin Pro Ser His Leu Thr Cys Glu Gly 
65 70 75 80 

Gly Lys Glu Glu Arg Glu Lys Asn Lys Lys Ala Glu Val Asn Pro Pro 
85 90 95 

Thr Gly Met Gly Leu Ala Asn Arg lie Pro Arg Asp Asp lie Thr Leu 
100 105 110 

Lys Leu Arg Asn Gin Gly Lys Leu Arg Thr Lys Glu Asn Arg Thr Gin 
115 120 125 

Ser Ala Lys Arg His Pro 
130 



<210> 316 

<211> 42 

<212> PRT 

<213> Homo sapiens 

<400> 316 

Val Ala Cys Lys Pro Glu Asn Arg Thr Lys Thr His Phe Ala Ser Ser 
15 10 15 

Pro Ala Cys Asp Gly His Ala Leu Gly Gly Gin Val Gly Phe Ala lie 
20 25 30 

Cys Phe Leu Ser Cys Leu Phe Pro Pro Met 
35 40 



<210> 317 
<211> 40 
<212> PRT 
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<213> Homo sapiens 
<400> 317 

Ser His Pro Met Pro Asn Thr Pro Gin Lys Gin Leu Leu Phe Ser Glu 
15 10 15 

Asp Asn Glu Leu Leu Val Ser Leu Arg Thr Gly Arg Lys Pro Thr Leu 
20 25 30 

Gin Ala Ala Leu Arg Val Thr Gly 
35 40 



<210> 318 

<211> 59 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (26) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<400> 318 

Glu Gly Asp Pro Arg Gly Arg Pro Arg Pro Arg Pro Leu Gly Pro Pro 
15 10 15 

Pro Gin Leu Thr Leu Pro Thr Ala Leu Xaa Asp lie Leu Arg Gin Val 
20 25 30 

Arg Ala Pro Gly Leu Arg Leu Ser Arg Ala Leu Glu Val Gly Arg Lys 
35 40 45 

Gly Ser Pro lie Phe Lys lie Gin lie Tyr Leu 
50 55 



<210> 319 
<211> 250 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (145) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 319 

Ala His Arg Leu Gin lie Arg Leu Leu Thr Trp Asp Val Lys Asp Thr 
15 10 15 

Leu Leu Arg Leu Arg His Pro Leu Gly Glu Ala Tyr Ala Thr Lys Ala 
20 25 30 

Arg Ala His Gly Leu Glu Val Glu Pro Ser Ala Leu Glu Gin Gly Phe 
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35 40 45 

Arg Gin Ala Tyr Arg Ala Gin Ser His Ser Phe Pro Asn Tyr Gly Leu 
50 55 60 

Ser His Gly Leu Thr Ser Arg Gin Trp Trp Leu Asp Val Val Leu Gin 
65 70 75 80 

Thr Phe His Leu Ala Gly Val Gin Asp Ala Gin Ala Val Ala Pro He 
85 90 95 

Ala Glu Gin Leu Tyr Lys Asp Phe Ser His Pro Cys Thr Trp Gin Val 
100 105 110 

Leu Asp Gly Ala Glu Asp Thr Leu Arg Glu Cys Arg Thr Arg Gly Leu 
115 120 125 

Arg Leu Ala Val He Ser Asn Phe Asp Arg Arg Leu Glu Gly He Leu 
130 135 140 

Xaa Gly Leu Gly Leu Arg Glu His Phe Asp Phe Val Leu Thr Ser Glu 
145 150 155 160 

Ala Ala Gly Trp Pro Lys Pro Asp Pro Arg He Phe Gin Glu Ala Leu 
165 170 175 

Arg Leu Ala His Met Glu Pro Val Val Ala Ala His Val Gly Asp Asn 
180 185 190 

Tyr Leu Cys Asp Tyr Gin Gly Pro Arg Ala Val Gly Met His Ser Phe 
195 200 205 

Leu Val Val Gly Pro Gin Ala Leu Asp Pro Val Val Arg Asp Ser Val 
210 215 220 

Pro Lys Glu His He Leu Pro Ser Leu Ala His Leu Leu Pro Ala Leu 
225 230 235 240 

Asp Cys Leu Glu Gly Ser Thr Pro Gly Leu 
245 250 



<210> 320 

<211> 27 

<212> PRT 

<213> Homo sapiens 

<400> 320 

He Arg Leu Leu Thr Trp Asp Val Lys Asp Thr Leu Leu Arg Leu Arg 
15 10 15 

His Pro Leu Gly Glu Ala Tyr Ala Thr Lys Ala 
20 25 
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<210> 321 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 321 

Leu Glu Gin Gly Phe Arg Gin Ala Tyr Arg Ala Gin Ser His Ser Phe 
15 10 15 

Pro Asn Tyr Gly Leu Ser His Gly 
20 



<210> 322 

<211> 26 

<212> PRT 

<213> Homo sapiens 

<400> 322 

His Leu Ala Gly Val Gin Asp Ala Gin Ala Val Ala Pro lie Ala Glu 
15 10 15 

Gin Leu Tyr Lys Asp Phe Ser His Pro Cys 
20 25 



<210> 323 

<211> 23 

<212> PRT 

<213> Homo sapiens 

<400> 323 

Val Leu Asp Gly Ala Glu Asp Thr Leu Arg Glu Cys Arg Thr Arg Gly 
15 10 15 

Leu Arg Leu Ala Val lie Ser 
20 



<210> 324 

<211> 26 

<212> PRT 

<213> Homo sapiens 

<400> 324 

Arg Glu His Phe Asp Phe Val Leu Thr Ser Glu Ala Ala Gly Trp Pro 
15 10 15 

Lys Pro Asp Pro Arg lie Phe Gin Glu Ala 
20 25 



<210> 325 
<211> 28 
<212> PRT 
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<213> Homo sapiens 
<400> 325 

Glu Pro Val Val Ala Ala His Val Gly Asp Asn Tyr Leu Cys Asp Tyr 
15 10 15 

Gin Gly Pro Arg Ala Val Gly Met His Ser Phe Leu 
20 25 



<210> 326 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 326 

Val Val Arg Asp Ser Val Pro Lys Glu His lie Leu Pro Ser Leu Ala 
1 5 10 15 

His Leu Leu Pro Ala Leu Asp 
20 



<210> 327 

<211> 22 

<212> PRT 

<213> Homo sapiens 

<400> 327 

lie Arg Lys Leu Gly Pro Gly Leu Ala Pro Cys Ser Cys Arg Ser Gly 
15 10 15 

Gin Val Phe Pro Arg Val 
20 



<210> 328 

<211> 241 

<212> PRT 

<213> Homo sapiens 

<400> 328 

Lys Pro Leu Arg Met Ala Arg Pro Gly Gly Pro Glu His Asn Glu Tyr 
15 10 15 

Ala Leu Val Ser Ala Trp His Ser Ser Gly Ser Tyr Leu Asp Ser Glu 
20 25 30 

Gly Leu Arg His Gin Asp Asp Phe Asp Val Ser Leu Leu Val Cys His 
35 40 45 

Cys Ala Ala Pro Phe Glu Glu Gin Gly Glu Ala Glu Arg His Val Leu 
50 55 60 

Arg Leu Gin Phe Phe Val Val Leu Thr Ser Gin Arg Glu Leu Phe Pro 
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65 

Arg Leu Thr Ala 



Pro Glu Pro Glu 
100 

Ser Gly Leu lie 
115 

Ala Ala Glu Val 
130 

Leu Ala Gly Gly 
145 

Leu Leu Glu Pro 



Ala Leu Ala Glu 
180 

lie Gly Asp lie 
195 

Ser Trp Tyr Gin 
210 

Ala Val Ala lie 
225 



70 

Asp Met Arg Arg 
85 

Ala Pro Gly Ser 



Leu Ala Pro Gly 
120 

Gly Met Ala Arg 
135 

His Cys Arg Arg 
150 

Pro Gly Pro Asp 
165 

Leu Glu Glu Leu 



Asp Pro Gin Leu 
200 

Ser Leu lie Lys 
215 

Ser Lys Ala Gin 
230 



75 

Phe Arg Lys Pro 
90 

Ser Ala Gly Ser 
105 

Pro Ala Pro Leu 



Ala Arg Leu Ala 
140 

Asp Thr Leu Trp 
155 

Arg Leu Arg Leu 
170 

Leu Glu Ala Val 
185 

Asp Cys Phe Leu 



Val Leu Leu Ser 
220 

Thr Trp Glu Leu 
235 



80 

Pro Arg Leu Pro 
95 

Pro Gly Glu Ala 
110 

Phe Pro Pro Leu 
125 

Gin Leu Val Arg 



Lys Arg Leu Phe 
160 

Gly Gly Arg Leu 
175 

His Ala Lys Ser 
190 

Ser Met Thr Val 
205 

Arg Phe Pro Arg 



Ser Thr Trp Leu 
240 



Arg 



<210> 329 
<211> 30 
<212> PRT 

<213> Homo sapiens 
<400> 329 

Ala Arg Gly Thr Leu Glu Leu Pro Thr Pro Leu lie Ala Ala His Gin 
15 10 15 

Leu Tyr Asn Tyr Val Ala Asp His Ala Ser Ser Tyr His Met 
20 25 30 



<210> 330 

<211> 37 

<212> PRT 

<213> Homo sapiens 

<400> 330 
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Ser His Cys Glu Trp Pro Gly Gin Gly Ala Gin Asn Thr Thr Ser Met 
15 10 15 

Pro Trp Cys Arg His Gly Thr Val Leu Ala Pro Thr Trp Thr Leu Arg 
20 25 30 

Asp Phe Asp Thr Arg 
35 



<210> 331 

<211> 91 

<212> PRT 

<213> Homo sapiens 

<400> 331 

Pro Leu Thr Thr Val Ser His Leu Cys Pro Leu Ser Leu Arg Val Phe 
15 10 15 

Thr Ser His Leu Asp He Thr Ala Gly His Ser His Arg Asp Asp Thr 
20 25 30 

Trp Val Pro He Pro Ala Leu Pro Leu Lys His Leu Arg Pro Pro Ser 
35 40 45 

Ser Pro Phe Ala Leu Gly Pro Trp Val Ser His Pro Leu Met Arg Trp 
50 55 60 

Val Gin Lys Leu Ser His Leu His Ser Asn Pro Gly Thr Gly Phe Ser 
65 70 75 80 

Met Gly Gly Lys Ser Ala Glu Lys Leu Lys Cys 
85 90 



<210> 332 
<211> 179 
<212> PRT 
<213> Homo sapiens 

<400> 332 

Ser Thr Ala Ala Arg Gly Ala Pro Gly Pro Gly Arg Ala Gly Gly Thr 
15 10 15 

Pro Arg Ser Ser Pro Cys Gin He His Trp Gly His Arg Pro Pro Ala 
20 25 30 

Gly Leu Leu Pro He His Asp Gly Leu Leu Val Pro Glu Pro Asp Gin 
35 40 45 

Ser Ser Pro Lys Pro Leu Pro Gin Ser Cys Arg His Phe Gin Ser Pro 
50 55 60 

Asp Leu Gly Thr Gin Tyr Leu Val Ala Leu Asn Gin Lys Phe Thr Asp 
65 70 75 80 
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Cys Ser Ala Leu Val 
85 

Val Val Phe Arg Glu 
100 

Pro Pro Ala Gin Leu 
115 

Asn Thr Ala Cys Phe 
130 

Asp Trp Thr Thr Glu 
145 

Leu Pro Ala Arg Gly 
165 

Ala Arg His 



Phe Trp Thr Pro Leu Arg 
90 

Ala Leu Pro Val Gin Pro 
105 

Val Ser Thr Tyr His His 
120 

Thr Leu Leu Asp Pro Pro 
135 

Cys His Cys Ser Leu Asn 
150 155 

Arg Thr Asp Gin Pro Phe 
170 



Lys Asp Val Ser Glu 
95 

Gin Asp Thr Arg Ser 
110 

Leu Glu Ser Val lie 
125 

Pro Leu Lys Gly Val 
140 

His Gly Pro Thr Arg 
160 

Trp Ala Pro Gly Gin 
175 



<210> 333 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 333 

His Gin Arg Leu Cys Asn Tyr Val Leu Arg Val Cys Cys Pro Ser Leu 
15 10 15 

Ala Ala Gly Thr Ala Leu Pro Lys His Pro Gin Pro Leu Thr His Pro 
20 25 30 

Gly Leu Gin Arg Val Arg Ser Thr Pro Arg Thr Pro Trp Ala Leu Leu 
35 40 45 

Gly Tyr Ser Phe Arg Pro Pro Trp 
50 55 



<210> 334 

<211> 28 

<212> PRT 

<213> Homo sapiens 

<400> 334 

Pro Gly Gly Pro Glu His Asn Glu Tyr Ala Leu Val Ser Ala Trp His 
15 10 15 

Ser Ser Gly Ser Tyr Leu Asp Ser Glu Gly Leu Arg 
20 25 
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<210> 335 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 335 

Asp Val Ser Leu Leu Val Cys His Cys Ala Ala Pro Phe Glu Glu Gin 
15 10 15 

Gly Glu Ala Glu Arg His Val Leu Arg 
20 25 



<210> 336 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<400> 336 

Arg Leu Thr Ala Asp Met Arg Arg Phe Arg Lys Pro Pro Arg Leu Pro 
15 10 15 

Pro Glu Pro Glu Ala Pro Gly Ser Ser Ala Gly Ser 
20 25 



<210> 337 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 337 

Gly Glu Ala Ser Gly Leu lie Leu Ala Pro Gly Pro Ala Pro Leu Phe 
15 10 15 

Pro Pro Leu Ala Ala Glu Val Gly Met 
20 25 



<210> 338 

<211> 23 

<212> PRT 

<213> Homo sapiens 

<400> 338 

Thr Leu Trp Lys Arg Leu Phe Leu Leu Glu Pro Pro Gly Pro Asp Arg 
1 5 10 15 

Leu Arg Leu Gly Gly Arg Leu 
20 



<210> 339 
<211> 28 
<212> PRT 
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<213> Homo sapiens 
<400> 339 

Leu Ala Glu Leu Glu Glu Leu Leu Glu Ala Val His Ala Lys Ser lie 
15 10 15 

Gly Asp lie Asp Pro Gin Leu Asp Cys Phe Leu Ser 
20 25 



<210> 340 

<211> 197 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (97) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 340 

Phe Gin Leu Tyr Phe Asn Pro Glu Leu lie Phe Lys His Phe Gin lie 
15 10 15 

Trp Arg Leu lie Thr Asn Phe Leu Phe Phe Gly Pro Val Gly Phe Asn 
20 25 30 

Phe Leu Phe Asn Met lie Phe Leu Tyr Arg Tyr Cys Arg Met Leu Glu 
35 40 45 

Glu Gly Ser Phe Arg Gly Arg Thr Ala Asp Phe Val Phe Met Phe Leu 
50 55 60 

Phe Gly Gly Phe Leu Met Thr Leu Phe Gly Leu Phe Val Ser Leu Val 
65 70 75 80 

Phe Leu Gly Gin Ala Phe Thr lie Met Leu Val Tyr Val Trp Ser Arg 
85 90 95 

Xaa Asn Pro Tyr Val Arg Met Asn Phe Phe Gly Leu Leu Asn Phe Gin 
100 105 110 

Ala Pro Phe Leu Pro Trp Val Leu Met Gly Phe Ser Leu Leu Leu Gly 
115 120 125 

Asn Ser lie lie Val Asp Leu Leu Gly lie Ala Val Gly His lie Tyr 
130 135 140 

Phe Phe Leu Glu Asp Val Phe Pro Asn Gin Pro Gly Gly lie Arg lie 
145 150 155 160 

Leu Lys Thr Pro Ser lie Leu Lys Ala lie Phe Asp Thr Pro Asp Glu 
165 170 175 

Asp Pro Asn Tyr Asn Pro Leu Pro Glu Glu Arg Pro Gly Gly Phe Ala 
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180 185 190 



Trp Gly Glu Gly Gin 
195 



<210> 341 
<211> 108 
<212> PRT 
<213> Homo sapiens 

<400> 341 

Gly Val Gly Gin Ala Thr Val Gly Lys Met Ala Tyr Gin Ser Leu Arg 
15 10 15 

Leu Glu Tyr Leu Gin lie Pro Pro Val Ser Arg Ala Tyr Thr Thr Ala 
20 25 30 

Cys Val Leu Thr Thr Ala Ala Val Gin Leu Glu Leu lie Thr Pro Phe 
35 40 45 

Gin Leu Tyr Phe Asn Pro Glu Leu lie Phe Lys His Phe Gin lie Trp 
50 55 60 

Arg Leu He Thr Asn Phe Leu Phe Phe Gly Pro Val Gly Phe Asn Phe 
65 70 75 80 

Leu Phe Asn Met He Phe Leu Tyr Arg Tyr Cys Arg Met Leu Glu Glu 
85 90 95 

Gly Ser Phe Arg Gly Arg Thr Ala Asp Phe Val Phe 
100 105 



<210> 342 

<211> 23 

<212> PRT 

<213> Homo sapiens 

<400> 342 

Leu He Phe Lys His Phe Gin lie Trp Arg Leu He Thr Asn Phe Leu 
15 10 15 

Phe Phe Gly Pro Val Gly Phe 
20 



<210> 343 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 343 

Phe Leu Tyr Arg Tyr Cys Arg Met Leu Glu Glu Gly Ser Phe Arg Gly 
15 10 15 
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Arg Thr Ala Asp Phe Val Phe Met Phe 
20 25 



<210> 344 

<211> 23 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (19) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 344 

Leu Val Phe Leu Gly Gin Ala Phe Thr lie Met Leu Val Tyr Val Trp 
15 10 15 

Ser Arg Xaa Asn Pro Tyr Val 
20 



<210> 345 
<211> 21 
<212> PRT 

<213> Homo sapiens 
<400> 345 

Val Leu Met Gly Phe Ser Leu Leu Leu Gly Asn Ser lie lie Val Asp 
15 10 15 

Leu Leu Gly lie Ala 
20 



<210> 346 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 346 

Asn Gin Pro Gly Gly lie Arg lie Leu Lys Thr Pro Ser lie Leu Lys 
15 10 15 

Ala lie Phe Asp Thr Pro Asp Glu Asp 
20 25 



<210> 347 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<400> 347 
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Arg Leu Glu Tyr Leu Gin lie Pro Pro Val Ser Arg Ala Tyr Thr Thr 
15 10 15 

Ala Cys Val Leu Thr Thr Ala Ala Val Gin Leu Glu 
20 25 



<210> 348 

<211> 31 

<212> PRT 

<213> Homo sapiens 

<400> 348 

Arg Leu lie Thr Asn Phe Leu Phe Phe Gly Pro Val Gly Phe Asn Phe 
15 10 15 

Leu Phe Asn Met lie Phe Leu Tyr Arg Tyr Cys Arg Met Leu Glu 
20 25 30 

<210> 349 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 349 

His Ala Ser Ala Gly Pro Asp Gly Ser Ser Pro Ala 
15 10 



<210> 350 
<211> 115 
<212> PRT 
<213> Homo sapiens 

<400> 350 

Glu Leu Leu Leu Glu Lys Pro Lys Pro Trp Gin Pro Pro Ala Ala Ala 
15 10 15 

Pro His Arg Ala Leu Leu Val Leu Cys Tyr Ser lie Val Glu Asn Thr 
20 25 30 

Cys lie lie Thr Pro Thr Ala Lys Ala Trp Lys Tyr Met Glu Glu Glu 
35 40 45 

lie Leu Gly Phe Gly Lys Ser Val Cys Asp Ser Leu Gly Arg Arg His 
50 55 60 

Met Ser Thr Cys Ala Leu Cys Asp Phe Cys Ser Leu Lys Leu Glu Gin 
65 70 75 80 

Cys His Ser Glu Ala Ser Leu Gin Arg Gin Gin Cys Asp Thr Ser His 
85 90 95 

Lys Thr Pro Phe Ala Ala Pro Cys Leu Pro Pro Arg Ala Cys Pro Ser 
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100 



105 



110 



Ala Thr Arg 
115 



<210> 351 

<211> 77 

<212> PRT 

<213> Homo sapiens 

<400> 351 

Leu Pro Gly Trp Gly Phe Pro Thr Lys lie Cys Asp Thr Asp Tyr lie 
15 10 15 

Gin Tyr Pro Asn Tyr Cys Ser Phe Lys Ser Gin Gin Cys Leu Met Arg 
20 25 30 

Asn Arg Asn Arg Lys Val Ser Arg Met Arg Cys Leu Gin Asn Glu Thr 
35 40 45 

Tyr Ser Ala Leu Ser Pro Gly Lys Ser Glu Asp Val Val Leu Arg Trp 
50 55 60 

Ser Gin Glu Phe Ser Thr Leu Thr Leu Gly Gin Phe Gly 
65 70 75 



<210> 352 

<211> 65 

<212> PRT 

<213> Homo sapiens 

<400> 352 

Ser Pro Val Leu Leu Pro Ala Phe Pro Pro Leu Pro Val Pro Leu Leu 
15 10 15 

Ala Leu Pro Val Ser Ala Pro Leu Pro Ala Cys Val Leu Val Ser Ala 
20 25 30 

Pro Ala Cys Ala Pro Leu Leu Ala Pro Ala Cys Ala Leu Ala Leu Ala 
35 40 45 

Pro Gly Phe Pro Gly Thr Arg Arg lie Val Gly Ala Leu Pro Arg Cys 
50 55 60 

Cys 
65 



<210> 353 

<211> 35 

.<212> PRT 

<213> Homo sapiens 
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<400> 353 

Leu Leu Val Leu Cys Tyr Ser lie Val Glu Asn Thr Cys He He Thr 
1,5 10 15 

Pro Thr Ala Lys Ala Trp Lys Tyr Met Glu Glu Glu He Leu Gly Phe 
20 25 30 

Gly Lys Ser 
35 



<210> 354 
<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 354 

Leu Lys Leu Glu Gin Cys His Ser Glu Ala Ser Leu Gin Arg Gin Gin 
1 5 10 15 

Cys Asp Thr Ser His Lys Thr Pro Phe Ala 
20 25 



<210> 355 
<211> 40 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (27) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 355 

Gin Val Ser Gly Leu He Leu Ser Leu Ser Cys Gly Met Asp Gly Leu 
15 10 15 

Ala Leu Asp Gly Ser Pro Ser Pro Ser Pro Xaa Thr Glu Lys Ala Gly 
20 25 30 

Arg Cys He Ser Gin Thr Ser Leu 
35 40 



<210> 356 
<211> 46 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (27) 

<223> Xaa equals any of the naturally occurring L-amino acids 
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<400> 356 
Gin Val Ser Gly 
1 

Ala Leu Asp Gly 
20 

Arg Cys lie Ser 
35 



Leu lie Leu Ser 
5 

Ser Pro Ser Pro 



Gin Thr Ser Leu 
40 



Leu Ser Cys Gly 
10 

Ser Pro Xaa Thr 
25 

Pro Gly Lys Trp 



Met Asp Gly Leu 
15 

Glu Lys Ala Gly 
30 

Glu Val 
45 



<210> 357 
<211> 173 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (118) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 357 

Arg Ala Ser Lys Thr Val Pro Arg Met Pro Pro Asn Trp Pro Ala Lys 
15 10 15 

Met Pro Cys Leu Cys His lie Arg Thr Val Glu His Leu Gly Thr lie 
20 25 30 

Ser Ser Gly Ala Pro Gly Arg Pro Thr Gly Gin Gin Ala Ala Arg Thr 
35 40 45 

Tyr His lie Cys Trp lie His Pro Gly Gin Lys lie Asp Ser Leu Pro 
50 55 60 

Pro Ser Ser Gin His Pro Arg Ser Gin Gin Leu Ala Pro Gly Thr Trp 
65 70 75 80 

Pro Ser Thr Ser Thr Thr Lys Pro Ala Glu Glu Thr Leu Gly Ser Ser 
85 90 95 

Ala Ser Leu Pro lie Ser Gin Ala Arg Lys Ser Glu Lys Cys Thr Phe 
100 105 110 

Gin Pro Ser Pro Trp Xaa Val Arg Gly Lys Glu Ser His Gin Val Pro 
115 120 125 

Ala His Pro Ser His Arg Thr Glu Thr Glu Ser Asp His Ser Pro Val 
130 135 140 

Arg Lys Pro Pro Ser Arg Gly Thr Arg Thr Gly Asp Phe Thr Val Gly 
145 150 155 160 

Asp Trp Ser Glu Ala Trp Leu Leu Glu Leu Ala Leu Leu 
165 170 
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<210> 358 

<211> 23 

<212> PRT 

<213> Homo sapiens 

<400> 358 

Arg Met Pro Pro Asn Trp Pro Ala Lys Met Pro Cys Leu Cys His lie 
1 5 10 15 

Arg Thr Val Glu His Leu Gly 
20 



<210> 359 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 359 

Gly Arg Pro Thr Gly Gin Gin Ala Ala Arg Thr Tyr His lie Cys Trp 
15 10 15 

lie His Pro Gly Gin Lys lie Asp Ser 
20 25 



<210> 360 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 360 

Trp Pro Ser Thr Ser Thr Thr Lys Pro Ala Glu Glu Thr Leu Gly Ser 
15 10 15 

Ser Ala Ser Leu Pro lie Ser Gin Ala 
20 25 



<210> 361 

<211> 23 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (13) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<400> 361 

Lys Ser Glu Lys Cys Thr Phe Gin Pro Ser Pro Trp Xaa Val Arg Gly 
15 10 15 



Lys Glu Ser His Gin Val Pro 
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<210> 362 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 362 

Lys Pro Pro Ser Arg Gly Thr Arg Thr Gly Asp Phe Thr Val Gly Asp 
15 10 15 

Trp Ser Glu Ala Trp Leu Leu Glu 
20 



<210> 363 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 363 

Pro Cys Ala Asp Cys Leu Ser Ala Trp- Ala 
15 10 



<210> 364 

<211> 11 

<212> PRT 

<213> Homo sapiens 



<400> 364 

His Ala Ser Gly Tyr Leu Cys lie Val Leu Leu 
15 10 



<210> 365 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<400> 365 

Asn Ser Ala Arg Ala Ala Arg Ala Glu lie Val Leu Gly Leu Leu Val 
15 10 15 

Trp Thr Leu lie Ala Gly Thr Glu Tyr Phe Arg Val Pro Ala Phe Gly 
20 25 30 

Trp Val 



<210> 366 
<211> 22 
<212> PRT 
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<213> Homo sapiens 
<400> 366 

Pro Cys Ser Pro Pro Asp Ser Pro Pro Leu Pro Gly Ala Phe Val Trp 
15 10 15 

Arg Val Leu Trp Val Cys 
20 



<210> 367 
<211> 25 
<212> PRT 

<213> Homo sapiens 



<400> 367 

Ala Arg Ala Cys Phe Ala Tyr Asn 
1 5 

Trp Asp Ser His Phe His Gly Ser 
20 



Gly Val Cys Ser Glu Gly Arg Cys 
10 15 

Val 
25 



<210> 368 
<211> 100 
<212> PRT 
< 2 1 3 > Homo s ap i ens 



<400> 368 
Met Ser Asn Met 
1 

Asn Lys Tyr lie 
20 

Lys Ser Thr Val 
35 

Lys Asn Lys Met 
50 

Gin lie Asp lie 
65 

Ser Lys Arg Tyr 



Gly Lys lie Pro 
5 

Cys Ser Arg lie 



Leu Gin lie Cys 
40 

Ser Asp His Ser 
55 

His Ser Leu Gly 
70 

Cys Thr Leu Leu 
85 



Ser Leu Ser Leu 
10 

Pro Lys Phe lie 
25 

Leu Lys Arg Gin 



Lys lie Gly Lys 
60 

lie Val Glu Thr 
75 

Thr Glu Gin Ser 
90 



His lie Pro lie 
15 

Gin Lys Val Asn 
30 

lie lie Leu Asn 
45 

Ala Asn Leu Val 



Gly Cys Val Pro 
80 

Gly Phe Pro Phe 
95 



Leu Ser His Pro 
100 



<210> 369 

<211> 84 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 

<222> (54) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 

<222> (58) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 

<222> (82) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 

<400> 369 

Met Ala Gly Cys Cys Leu Lys Leu Phe Gly Val Leu Ser Leu Cys Phe 
1 5 10 15 

Leu Cys Gly Leu lie Ser lie Glu Arg Val lie Cys Asn Pro Val Ser 
20 25 30 

Ala Asp Phe Gin Val Ser Thr Phe Cys Gin Arg His Cys Leu Leu Arg 
35 40 45 

Ser Lys Val Met Phe Xaa lie Lys Gly Xaa Thr Ala Thr lie Glu Val 
50 55 60 

lie Asn Glu Asn Cys Thr Leu Val Ala Ala Pro Pro lie Gly Phe Pro 
65 70 75 80 

lie Xaa Phe Leu 



<210> 370 

<211> 49 

<212> PRT 

<213> Homo sapiens 

<400> 370 

Met Ser Asp His Ser Lys lie Gly Lys Ala Asn Leu Val Gin lie Asp 
15 10 15 

He His Ser Leu Gly He Val Glu Thr Gly Cys Val Pro Ser Lys Arg 
20 25 30 

Tyr Cys Thr Leu Leu Thr Glu Gin Ser Gly Phe Pro Phe Leu Ser His 
35 40 45 

Pro 
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<210> 371 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<400> 371 

Met Ala Gly Cys Cys Leu Lys Leu Phe Gly Val Leu Ser Leu Cys Phe 
15 10 15 

Leu Cys Gly Leu lie Ser lie Glu Arg Val lie Cys Asn Pro Val Ser 
20 25 30 

Ala Asp Phe Gin Val Ser Thr Phe Cys Gin Arg His Cys Leu Leu Arg 
35 40 45 

Ser Lys 
50 



<210> 372 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (4) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (8) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (32) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 372 

Val Met Phe Xaa lie Lys Gly Xaa Thr Ala Thr lie Glu Val lie Asn 
15 10 15 

Glu Asn Cys Thr Leu Val Ala Ala Pro Pro lie Gly Phe Pro lie Xaa 
20 25 30 

Phe Leu 



<210> 373 

<211> 65 

<212> PRT 

<213> Homo sapiens 
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<400> 373 
Pro Thr Glu Gly 
1 

Ser Ala Leu Ala 
20 

Val Leu Ser Trp 
35 

Lys Asp Glu Val 
50 



Arg Gin Lys Val 
5 

Met Thr Lys Thr 

Tyr Thr Phe Leu 
40 

Lys Pro Lys lie 
55 



Leu Lys Thr Phe 
10 

Ser Thr Cys lie 
25 

Asn Tyr Tyr lie 



Leu Ala Asn Gly 
60 



Thr Val Pro Arg 
15 

Tyr His Phe Leu 
30 

Ser Gin Glu Gly 
45 

Ala Arg Trp Lys 



Tyr 
65 



<210> 374 
<211> 35 
<212> PRT 

<213> Homo sapiens 
<400> 374 

Pro Arg Ser Ala Leu Ala Met Thr Lys Thr Ser Thr Cys lie Tyr His 
15 10 15 

Phe Leu Val Leu Ser Trp Tyr Thr Phe Leu Asn Tyr Tyr lie Ser Gin 
20 25 30 

Glu Gly Lys 
35 



<210> 375 
<211> 24 
<212> PRT 

<213> Homo sapiens 
<400> 375 

Pro Thr Glu Gly Arg Gin Lys Val Leu Lys Thr Phe Thr Val Pro Arg 
15 10 15 

Ser Ala Leu Ala Met Thr Lys Thr 
20 



<210> 376 
<211> 27 
<212> PRT 

<213> Homo sapiens 
<400> 376 

Phe Leu Asn Tyr Tyr lie Ser Gin Glu Gly Lys Asp Glu Val Lys Pro 
15 10 15 
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Lys lie Leu Ala Asn Gly Ala Arg Trp Lys Tyr 
20 25 



<210> 377 

<211> 13 

<212> PRT 

<213> Homo sapiens 

<400> 377 

Phe Lys Asp Gin Leu Val Tyr Pro Leu Leu Ala Phe Thr 
15 10 



<210> 378 
<211> 13 
<212> PRT 

<213> Homo sapiens 
<400> 378 

Arg Gin Ala Leu Asn Leu Pro Asp Val Phe Gly Leu Val 
15 10 



<210> 379 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 379 

Ala Thr Ala Ser His Asp Leu Leu Leu Phe 
15 10 



<210> 380 
<211> 97 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (72) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 380 

Met Ser lie Asn lie Cys Leu Met Gin Ser Lys Thr Gin Gly Ser Cys 
1 5 10 15 

Gin Tyr Leu Leu Leu Pro His Pro Val Pro lie lie Leu Lys Val Ser 
20 25 30 

Thr Val Phe Ser Leu Leu Ser Leu Phe Arg Leu Leu Phe Leu Ser Phe 
35 40 45 

Cys Pro His Pro Lys Lys Cys Ser Tyr Leu Leu -Lys Tyr Tyr Gly Pro 
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50 55 60 

Leu Glu Gly His Lys Thr Leu Xaa Tyr Leu Arg Thr Asn Leu Gly Val 
65 70 75 80 

lie Gin Pro Pro Leu Arg Met Tyr Ala Ala Glu Asp Cys Asn Gly lie 
85 90 95 

Gly 



<210> 381 
<211> 46 
<212> PRT 

<213> Homo sapiens 
<400> 381 

Met Ser lie Asn lie Cys Leu Met Gin Ser Lys Thr Gin Gly Ser Cys 
15 10 15 

Gin Tyr Leu Leu Leu Pro His Pro Val Pro lie lie Leu Lys Val Ser 
20 25 30 

Thr Val Phe Ser Leu Leu Ser Leu Phe Arg Leu Leu Phe Leu 
35 40 45 



<210> 382 

<211> 51 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (26) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 382 

Ser Phe Cys Pro His Pro Lys Lys Cys Ser Tyr Leu Leu Lys Tyr Tyr 
1 5 10 15 

Gly Pro Leu Glu Gly His Lys Thr Leu Xaa Tyr Leu Arg Thr Asn Leu 
20 25 30 

Gly Val He Gin Pro Pro Leu Arg Met Tyr Ala Ala Glu Asp Cys Asn 
35 40 45 

Gly He Gly 
50 



<210> 383 
<211> 23 
<212> PRT 
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<213> Homo sapiens 
<400> 383 

Lys Glu Glu Asp Asp Asp Thr Glu Arg Leu Pro Ser Lys Cys Glu Val 
15 10 15 

Cys Lys Leu Leu Ser Thr Glu 
20 



<210> 384 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 384 

Lys Glu Glu Asp Asp Asp Thr Glu Arg Leu Pro Ser Lys Cys Glu Val 
15 10 15 

Cys Lys Leu Leu Ser Thr Glu 
20 



<210> 385 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 385 

Leu Gin Ala Glu Leu Ser Arg Thr Gly Arg Ser Arg Glu Val Leu Glu 
15 10 15 

Leu Gly Gin 



<210> 386 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 386 

Leu Gin Ala Glu Leu Ser Arg Thr Gly Arg Ser Arg Glu Val Leu Glu 
15 10 15 

Leu Gly Gin 



<210> 387 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 387 
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Arg Gin Ala Val lie Val Cys Arg Arg Arg Phe Val 
15 10 



<210> 388 
<211> 148 
<212> PRT 
<213> Homo sapiens 

<400> 388 

Pro Pro Arg Trp Ala His Pro Lys Ala Pro Glu Gly Ser Pro Asp Pro 
15 10 15 

Pro Ser Pro Pro Ser Ala Leu Gly Leu Ser Val Leu Pro Trp Ser Asp 
20 25 30 

Ser Asp Pro Trp His lie Ser Val Ser Pro Cys Ala Gin Arg Glu His 
35 40 45 

Tyr Ser Pro Gly Ser Ala His lie Asn Ser Leu Arg Pro Leu Pro Ala 
50 55 60 

Leu Ser Leu Lys Arg Cys Lys Ala Arg Val Ser Ser Ser Cys Leu Tyr 
65 70 75 80 

Pro Ala Pro Ala Pro Ala Pro Ala Pro Leu Glu lie Asp Arg Cys Asp 
85 90 95 

Ser Val Pro Pro Val Ala Leu Cys Ser Ala Ala Tyr Thr Leu Arg lie 
100 105 110 

Cys Trp Ala Ser Val Leu Cys His Arg Pro Pro Pro Ser Thr Ser Gin 
115 120 125 

Pro Lys Pro Arg Ala Arg Pro Lys Lys Gly Lys Ala lie Phe Pro Thr 
130 135 140 

Ala Gin Val Pro 
145 



<210> 389 

<211> 71 

<212> PRT 

<213> Homo sapiens 

<400> 389 

Pro Pro Arg Trp Ala His Pro Lys Ala Pro Glu Gly Ser Pro Asp Pro 
15 10 15 

Pro Ser Pro Pro Ser Ala Leu Gly Leu Ser Val Leu Pro Trp Ser Asp 
20 25 30 

Ser Asp Pro Trp His lie Ser Val Ser Pro Cys Ala Gin Arg Glu His 
35 40 45 
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Tyr Ser Pro Gly Ser Ala His lie Asn Ser Leu Arg Pro Leu Pro Ala 
50 55 60 

Leu Ser Leu Lys Arg Cys Lys 
65 70 



<210> 390 
<211> 77 
<212> PRT 

<213> Homo sapiens 
<400> 390 

Ala Arg Val Ser Ser Ser Cys Leu Tyr Pro Ala Pro Ala Pro Ala Pro 
1 5 10 15 

Ala Pro Leu Glu lie Asp Arg Cys Asp Ser Val Pro Pro Val Ala Leu 
20 25 30 

Cys Ser Ala Ala Tyr Thr Leu Arg lie Cys Trp Ala Ser Val Leu Cys 
35 40 45 

His Arg Pro Pro Pro Ser Thr Ser Gin Pro Lys Pro Arg Ala Arg Pro 
50 55 60 

Lys Lys Gly Lys Ala lie Phe Pro Thr Ala Gin Val Pro 
65 70 75 



<210> 391 
<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 391 

Glu Glu Lys Leu Phe Thr Ser Ala Pro Gly Arg Asp Phe Trp Val Met 
15 10 15 

Gly Glu Thr Arg Asp Gly Asn Glu Glu Asn 
20 25 



<210> 392 
<211> 42 
<212> PRT 

<213> Homo sapiens 
<400> 392 

Gin Lys Pro Thr Phe Ala Leu Gly Glu Leu Tyr Pro Pro Leu lie Asn 
1 5 10 15 

Leu Trp Glu Ala Gly Lys Glu Lys Ser Thr Ser Leu Lys Val Lys Ala 
20 25 30 
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Thr Val lie Gly Leu Pro Thr Asn Met Ser 
35 40 
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